The prerequisites for this course are Calculus I, Data Structures, and Discrete Mathematics (this last is a co-req).
This course will introduce core principles of learning from data. More and more decisions are being made by algorithms that operate on large datasets, and this course will give students the tools to understand and contribute to this process. Throughout we will emphasize the ethical use of data and analyze case studies of how data science has intersected with society. This course will have a significant theory component, covering introductory linear algebra, probability, statistics, modeling, information theory, and optimization. However, we will also implement these ideas (in Python) and apply them to concrete datasets from a variety of fields (including images, video, text, DNA, music, art, etc).
The language for this course is Python 3.
See the Schedule for each week's reading assignment.
The schedule is tentative and subject to change throughout the semester.
WEEK | DAY | ANNOUNCEMENTS | TOPIC & READING | LABS |
1 | Jan 21 | Introduction to Data Science and Python
Reading:
| Tues: Thurs (virtual asynchronous class): | |
Jan 23 | ||||
2 | Jan 28 | Introduction to Modeling
Reading:
| Tues: Thurs: Lab 1: Computing and plotting in Python | |
Jan 30 | ||||
3 | Feb 04 | Applied Linear Algebra and Optimization
Reading:
| Tues: Thurs: Lab 2: Modeling climate change | |
Feb 06 | Last day to drop (Feb 07) | |||
4 | Feb 11 | Evaluation Metrics
Reading:
| Tues: Thurs: Lab 3: Gradient descent | |
Feb 13 | ||||
5 | Feb 18 | Ethics: Disparate Impact (+ review)
Reading:
| Tues: Thurs: Lab 4: Evaluation Metrics | |
Feb 20 | ||||
6 | Feb 25 | Probabilistic modeling I
Reading:
| Tues: Thurs: | |
Feb 27 | ||||
7 | Mar 04 | Probabilistic modeling II
Reading:
| Midterm 1 | |
Mar 06 | ||||
Mar 11 | Spring Break | |||
Mar 13 | ||||
8 | Mar 18 | Information theory
Reading:
| Tues: Thurs: Lab 5: Naive Bayes | |
Mar 20 | ||||
9 | Mar 25 | Visualization
Reading:
| Tues: Thurs: Lab 6: Information Theory | |
Mar 27 | ||||
10 | Apr 01 | Introduction to statistics
Reading: | Tues: Thurs: Lab 7: Logistic Regression and Visualization + Project Proposal | |
Apr 03 | ||||
11 | Apr 08 | Midterm II review
| Tues: Thurs: Lab 8: Statistics and Visualization | |
Apr 10 | ||||
12 | Apr 15 | Unsupervised learning
Reading:
| Tues: Midterm 2 | |
Apr 17 | ||||
13 | Apr 22 | Intro to neural networks
Reading:
| ||
Apr 24 | ||||
14 | Apr 29 | Project Presentations
| ||
May 01 | Last day to pass/fail (May 02) |