The prerequisites for this course are Calculus I, Data Structures, and Discrete Mathematics (this last is a co-req).
This course will introduce core principles of learning from data. More and more decisions are being made by algorithms that operate on large datasets, and this course will give students the tools to understand and contribute to this process. Throughout we will emphasize the ethical use of data and analyze case studies of how data science has intersected with society. This course will have a significant theory component, covering introductory linear algebra, probability, statistics, modeling, information theory, and optimization. However, we will also implement these ideas (in Python) and apply them to concrete datasets from a variety of fields (including images, video, text, DNA, music, art, etc).
The language for this course is Python 3.
See the Schedule for each week's reading assignment.
The schedule is tentative and subject to change throughout the semester.
WEEK | DAY | ANNOUNCEMENTS | TOPIC & READING | LABS |
1 | Aug 31 | Introduction to Data Science and Python
Reading:
| Tues: Thurs: | |
Sep 02 | ||||
2 | Sep 07 | Introduction to Modeling
Reading:
| Tues: Thurs: Lab 2: Modeling climate change | |
Sep 09 | ||||
3 | Sep 14 | Applied Linear Algebra and Optimization
Reading:
| Tues: Thurs: Lab 3: Gradient descent | |
Sep 16 | Last day to drop (Sep 17) | |||
4 | Sep 21 | Evaluation Metrics
Reading:
| Tues: Thurs: Lab 4: Evaluation Metrics | |
Sep 23 | ||||
5 | Sep 28 | Ethics: Disparate Impact (+ review)
Reading:
| Tues: Thurs: Midterm 1 | |
Sep 30 | ||||
6 | Oct 05 | Probabilistic modeling I
Reading:
| Tues: Thurs: Lab 5: Naive Bayes | |
Oct 07 | Last day to pass/fail (Oct 08) | |||
Oct 12 | Fall Break | |||
Oct 14 | ||||
7 | Oct 19 | Probabilistic modeling II
Reading:
| Tues: Thurs: Lab 5: (cont) | |
Oct 21 | ||||
8 | Oct 26 | Information theory
Reading:
| Tues: Thurs: Lab 6: Information Theory | |
Oct 28 | ||||
9 | Nov 02 | Visualization
Reading: | Tues: Thurs: Lab 7: Visualization + Project Proposal | |
Nov 04 | ||||
10 | Nov 09 | Introduction to statistics
Reading: | Tues: Thurs: Lab 8: Statistics | |
Nov 11 | ||||
11 | Nov 16 | Midterm II review
| Tues: Midterm 2 | |
Nov 18 | ||||
12 | Nov 23 | Unsupervised learning
Reading: | Tues: | |
Nov 25 | Thanksgiving (no class) | |||
13 | Nov 30 | Intro to neural networks
Reading: | ||
Dec 02 | ||||
14 | Dec 07 | Project Presentations
| ||
Dec 09 |