The prerequisites for this course are Calculus I, Data Structures, and Discrete Mathematics (this last is a co-req).
This course will introduce core principles of learning from data. More and more decisions are being made by algorithms that operate on large datasets, and this course will give students the tools to understand and contribute to this process. Throughout we will emphasize the ethical use of data and analyze case studies of how data science has intersected with society. This course will have a significant theory component, covering introductory linear algebra, probability, statistics, modeling, information theory, and optimization. However, we will also implement these ideas (in Python) and apply them to concrete datasets from a variety of fields (including images, video, text, DNA, music, art, etc).
The language for this course is Python 3.
See the Schedule for each week's reading assignment.
The schedule is tentative and subject to change throughout the semester.
WEEK | DAY | ANNOUNCEMENTS | TOPIC & READING | LABS |
1 | Sep 05 | Introduction to Data Science and Python
Reading:
| Tues: Thurs: | |
Sep 07 | ||||
2 | Sep 12 | Introduction to Modeling
Reading:
| Tues: Thurs: Lab 2: Modeling climate change | |
Sep 14 | ||||
3 | Sep 19 | Applied Linear Algebra and Optimization
Reading:
| Tues: Thurs: Lab 3: Gradient descent | |
Sep 21 | Last day to drop (Sep 22) | |||
4 | Sep 26 | Evaluation Metrics
Reading:
| Tues: Thurs: Lab 4: Evaluation Metrics | |
Sep 28 | ||||
5 | Oct 03 | Ethics: Disparate Impact (+ review)
Reading:
| Tues: Thurs: Midterm 1 | |
Oct 05 | ||||
6 | Oct 10 | Probabilistic modeling I
Reading:
| Tues: Thurs: Lab 5: Naive Bayes | |
Oct 12 | ||||
Oct 17 | Fall Break | |||
Oct 19 | ||||
7 | Oct 24 | Probabilistic modeling II
Reading:
| Tues: Thurs: Lab 6: Information Theory | |
Oct 26 | ||||
8 | Oct 31 | Information theory
Reading:
| Tues: Thurs: Lab 7: Logistic Regression and Visualization + Project Proposal | |
Nov 02 | ||||
9 | Nov 07 | Visualization
Reading:
| Tues: Thurs: Lab 8: Statistics and Visualization | |
Nov 09 | ||||
10 | Nov 14 | Introduction to statistics
Reading: | Tues: Thurs: Midterm 2 | |
Nov 16 | ||||
11 | Nov 21 | Midterm II review
| Tues: | |
Nov 23 | Thanksgiving (no class) | |||
12 | Nov 28 | Unsupervised learning
Reading:
| Tues: Thurs: | |
Nov 30 | ||||
13 | Dec 05 | Intro to neural networks
Reading:
| Tues: | |
Dec 07 | ||||
14 | Dec 12 | Project Presentations
| ||
Dec 14 | Last day to pass/fail (Dec 15) |