https://video.seas.harvard.edu/media/14_01_31+Bo+Peng--+IACS/1_ctmgbp2j
===========
Courtesy of Introduction to Data Science
http://datascienc.es/schedule/
http://berkeleydatascience.files.wordpress.com/2012/01/20120117berkeley1.pdf
- Identify problem
- Instrument data sources
- Collect data
- Prepare data (integrate, transform, clean, impute, filter, aggregate)
- Build model
- Evaluate model
- Communicate results
- Inspection
- Error checking
- Modification
- Comparison
- Modeling and model fitting
- Simulation
- What-if analyses
- Interpretation
- Presentation of conclusions
- Acquire
- Parse
- Filter
- Mine
- Represent
- Refine
- Interact
- Obtain
- Scrub
- Explore
- Model (build a Model) - Write an equation or code that describes the process just based on the data
- Interpret
"Today's approach to problem solving"
- Motivation (understand the problem from human point of view
- Look at realistic data (coming from real apis and web applications)
- Explore the potential solutions (different algorithmic approaches)
- Make the solution work
- Identify data to collect and its relevance to your problem
- Statistical specification of the problem
- Method selection
- Analysis of method
- Interpret results for non-statisticians
- Capture
- Curate
- Communicate
- Assemble an accurate and relevant data set
- Choose the appropriate algorithm