Case studies and lecture notes/materials for DS 501 Spring 2016. View the course syllabus for more information.
Case One: Collecting data from Twitter
Using Twitter ReST and streaming APIs to collect and explore data.
Case Two: Analyzing data from MovieLens
Using Pandas to analyze numerical movie ratings and demographic information from the MovieLens 1M Data Set.
Case Three: Textual analysis of movie reviews
Using Scikit-Learn to perform textual and sentiment analysis of 2k text-based movie reviews.
Case Four: Data Science Shark Tank
Using hacking, business, and math skills to come up with an idea of a new business/startup based upon data science technology.
Lecture Notes & Materials
- Introduction
- Data Gathering
- Data Storage
- Business Intelligence
- Basic Statistics, Probability, and Linear Algebra
- Machine Learning Part 1 and Midterm Review
- Midterm Exam
- Machine Learning Part 2
- Visualization
- Large-Scale Data Analysis
- Graph Data
- Graph Data and High Dimensional Data
- Deep Learning and Final Review
- Final Exam
Released under the MIT license. See LICENSE for details.