| Instructor: | Alessandro Gagliardi |
| ADFGagliardi+GA@Gmail.com | |
| EiRs: | Drew Stevens, dss9@columbia.edu |
| Nir Kaldero, nirkaldero@gmail.com | |
| Classes: | 6:30pm-9:30pm, Mondays and Wednesdays |
| April 14 – June 30, 2014 (no class May 26) | |
| Office Hours: | TBD |
Fields of Research: Industrial Organizations, Marketing (strategy) , Microeconomics, finance.
From a Taxonomy of Data Science (by Dataists)
A. Obtain
B. Scrub
C. Explore
D. Model
E. Interpret
A. Collect data around user retention, user actions within the product, potentially find data outside of company
B. Extract aggregated values from raw data
C. Examine data to find common distributions and correlations
D. Extract new meaning to predict if user would purchase again
E. Share results (and probably also go back to the drawing board)
1974 - Peter Naur published Concise Survey of Computer Methods:
A basic principle of data science is this: The data representation must be chosen with due regard to the transformation to be achieved and the data processing tools available. This stresses the importance of concern for the characteristics of the data processing tools.
At the completion of this course, you will be able to:
| Date | Milestone |
|---|---|
| 4/23 | Preliminary Project Proposals Due (3-4 sentences) |
| 4/30 | EiR Feedback on Project Proposals Returned |
| 5/7 | Formal Proposals Due (including data and methods chosen) |
| 5/14 | EiR Feedback on Formal Proposals Returned |
| 5/14 | Midterm Assessment Due |
| 5/21 | Projects live on Github |
| 5/21 | Project Elevator Pitch in class (4 minutes each) |
| 5/28 | Peer Feedback Due |
| 6/4 | At least one working model |
| 6/11 | Final EiR Feedback Due |
| 6/23-25 | Final Presentations (12 minutes each) |