The prerequisite for this course is CS35. There are no biology prerequisites for this course. The goal of this course is to introduce foundational algorithms that have become essential for learning from biological data. With the genome sequencing revolution of the last 20 years, it has become easier and cheaper to obtain genetic data, but often overwhelming to store, analyze, and make sense of this data. These issues have both driven new algorithm development and repurposed existing algorithms for biology.
We will study both types of algorithms, with a focus on the scientific method. By the end of this course, you should be able to ask a biological question, form a hypothesis about the answer, design a computational experiment to test your hypothesis, implement and execute the experiment, iterate your design and implementation based on the results, and finally interpret the results to form a biologically relevant conclusion. We will focus on synthetic and publicly available datasets, not generating new data.
The language for this course is Python 3.
WEEK | DAY | ANNOUNCEMENTS | TOPIC & READING | LABS |
1 | Jan 22 | Introduction to Bioinformatics and Molecular Biology
Reading:
| Mon: Wed: Fri: Lab 1: Working with sequences | |
Jan 24 | ||||
Jan 26 | ||||
2 | Jan 29 | Genome Assembly
Reading:
| Mon: Wed: Fri: Lab 2: Genome assembly | |
Jan 31 | ||||
Feb 02 | Drop/add ends | |||
3 | Feb 05 | Pairwise Sequence Alignment
Reading:
| Mon: Wed: Fri: Lab 3: Pairwise sequence alignment | |
Feb 07 | ||||
Feb 09 | ||||
4 | Feb 12 | BWT and Read Mapping
Reading:
| Mon: Wed: Fri: Lab 4: BWT and read mapping | |
Feb 14 | ||||
Feb 16 | ||||
5 | Feb 19 | Phylogenetic Trees
Reading:
| Mon: Wed: Fri:
In-lab practice midterm | |
Feb 21 | ||||
Feb 23 | ||||
6 | Feb 26 | Phylogenetic Trees
Reading
| Mon: Wed: Fri: In-lab Midterm 1 | |
Feb 28 | ||||
Mar 02 | ||||
7 | Mar 05 | Ancestral state reconstruction
Reading
| Mon: Thurs: Fri: Lab 5: Phylogenetic Trees | |
Mar 07 | ||||
Mar 09 | ||||
Mar 12 | Spring Break | |||
Mar 14 | ||||
Mar 16 | ||||
8 | Mar 19 | Population Genetics
Reading
| Mon: Wed: Fri: Lab 6: Perfect Phylogeny | |
Mar 21 | ||||
Mar 23 | ||||
9 | Mar 26 | Hidden Markov Models 1
Reading
| Mon: Wed: Fri: Lab 7: Population Genetics | |
Mar 28 | ||||
Mar 30 | CR/NC/W Deadline | |||
10 | Apr 02 | Hidden Markov Models 2
Reading
| Mon: Wed: Fri: Lab 8: Hidden Markov Models | |
Apr 04 | ||||
Apr 06 | ||||
11 | Apr 09 | Principal Components Analysis
Reading:
| Mon: Wed: Fri: Lab 9: PCA | |
Apr 11 | ||||
Apr 13 | ||||
12 | Apr 16 | Midterm Review
| Mon: Wed: Fri: Project: Proposal | |
Apr 18 | ||||
Apr 20 | ||||
13 | Apr 23 | Special topics: GWAS and Deep Learning
| Mon: Wed: Fri: In-lab Midterm 2 | |
Apr 25 | ||||
Apr 27 | ||||
14 | Apr 30 | Special topic: Ethics and the Genome
Reading: | Mon: Wed: Fri: Project: Presentation | |
May 02 | ||||
May 04 |