Intro to Scikit-Learn

Research Triangle Analysts
January 2014
Presented by Tim Hopper

Abstract: Scikit-learn is an actively developed Python package providing an implementation of many machine learning algorithms (e.g. SVM, kNN, linear models, HMM, k-Means, spectral clustering). However, the benefits of Scikit-learn goes well beyond carefully implemented learning algorithms. Being built in Python, it allows easy integration with countless other Python modules for tasks such as plotting, data munging, and application development. Its consistent API across algorithms allows for rapid experimentation with multiple learning methods. Also, Scikit-learn is well documented and provides lots of examples.

Instead of discussing particular machine learning algorithms provided by the package, I will focus on Scikit-learn and Python as a toolkit for solving data problems from start to finish. I will emphasize the Pipeline tool which allows the user to chain together all the steps of a machine learning pipeline including preprocessing, dimensionality reduction, feature selection, and model fitting.

A (poor quality) video of this talk is here.

The slides for this presentation are generated from Intro to Scikit-Learn.ipynb.

To view the slides in a browser run the following command:

ipython nbconvert Intro\ to\ Scikit-Learn.ipynb --to slides --post serve

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
Intro to Scikit-Learn.ipynb		Intro to Scikit-Learn.ipynb
LICENSE		LICENSE
README.md		README.md
facebook_map.png		facebook_map.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intro to Scikit-Learn

About

Releases

Packages

License

tdhopper/intro-to-scikit-learn

Folders and files

Latest commit

History

Repository files navigation

Intro to Scikit-Learn

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages