The prerequisite for this course is CS260 Foundations of Data Science. There are no biology prerequisites for this course. The goal of this course is to introduce foundational algorithms that have become essential for learning from biological data. With the genome sequencing revolution of the last 20 years, it has become easier and cheaper to obtain genetic data, but often overwhelming to store, analyze, and make sense of this data. These issues have both driven new algorithm development and repurposed existing algorithms for biology.
We will study both types of algorithms, with a focus on the scientific method. By the end of this course, you should be able to ask a biological question, form a hypothesis about the answer, design a computational experiment to test your hypothesis, implement and execute the experiment, iterate your design and implementation based on the results, and finally interpret and visualize the results to form a biologically relevant conclusion. We will focus on synthetic and publicly available datasets, not generating new data.
The language for this course is Python 3.
WEEK | DAY | ANNOUNCEMENTS | TOPIC & READING | LABS |
1 | Sep 03 | Introduction to Bioinformatics and Molecular Biology
Reading:
| Tues: Thurs: | |
Sep 05 | ||||
2 | Sep 10 | BWT and Read Mapping
Reading:
| Tues: Thurs: Lab 1: String search | |
Sep 12 | ||||
3 | Sep 17 | Genome Assembly
Reading:
| Tues: Thurs: Lab 2: BWT and read mapping | |
Sep 19 | Drop ends (Sep 20) | |||
4 | Sep 24 | Pairwise Sequence Alignment
Reading:
| Tues: Thurs: Lab 3: Genome assembly | |
Sep 26 | ||||
5 | Oct 01 | Phylogenetic Trees 1
Reading:
| Tues: Thurs: Lab 4: Pairwise sequence alignment | |
Oct 03 | ||||
6 | Oct 08 | Midterm Review
| Tues: Thurs:
In-class Midterm 1 | |
Oct 10 | ||||
Oct 15 | Fall Break | |||
Oct 17 | ||||
7 | Oct 22 | Phylogenetic Trees 2
Reading
| ||
Oct 24 | ||||
8 | Oct 29 | Ancestral State Reconstruction
Reading
|
Lab 5: Phylogenetic trees | |
Oct 31 | ||||
9 | Nov 05 | Population Genetics
Reading
|
Lab 6: Perfect phylogeny | |
Nov 07 | ||||
10 | Nov 12 | Hidden Markov Models 1
Reading
|
Lab 7: Population genetics | |
Nov 14 | ||||
11 | Nov 19 | Hidden Markov Models 2
Reading
| Lab 8: Hidden Markov Models | |
Nov 21 | ||||
12 | Nov 26 | Visualizing Genomes
| In-class midterm 2 | |
Nov 28 | Thanksgiving (no class) | |||
13 | Dec 03 | Deep Learning in Genomics
| Final Project | |
Dec 05 | ||||
14 | Dec 10 | Ethics and the Genome + Project Presentations
|
| |
Dec 12 |