Website with various data science and machine learning resources
Project maintained by AdiBroHosted on GitHub Pages — Theme by mattgraham
Statistics Resources
General Statistics
Think Stats - Think Stats is an introduction to Probability and Statistics for Python programmers.
An Introduction to Statistical Learning - This book provides an introduction to statistical learning methods. Website contains a free PDF. Excellent resource!
An Introduction to Statistical Learning with Python Code - this repository contains Python code for a selection of tables, figures and LAB sections from the book ‘An Introduction to Statistical Learning with Applications in R’ by James, Witten, Hastie, Tibshirani (2013). The book was originally written with examples in R, Jordi Warmenhoven “translates” it into Python code.
The Probability and Statistics Cookbook - The probability and statistics cookbook is a succinct representation of various topics in probability theory and statistics. It provides a comprehensive mathematical reference reduced to its essence, rather than aiming for elaborate explanations. Source; GitHub Repo.
Statistical Rethinking with Python and PyMC3 -
Statistical Rethinking is an incredible good introductory book to Bayesian Statistics, its follows a Jaynesian and practical approach with very good examples and clear explanations. In this repository the codes were ported (originally in R and Stan) in the book to PyMC3.
An Introduction to Statistical Learning with Python Code - his repository contains Python code for a selection of tables, figures and LAB sections from the book ‘An Introduction to Statistical Learning with Applications in R’ by James, Witten, Hastie, Tibshirani (2013). The book was originally written with examples in R, Jordi Warmenhoven “translates” it into Python code.
Computer-age Statistical Inference - A 2016 book that covers various topics in statistical inference that are relevant in this data-science era, with scalable techniques applicable to large datasets. A free PDF can be downloaded from here.
K-means and Hierarchical Clustering - clustering tutorial from Andrew Moore’s CS class at Carnegie Mellon. (There are additional tutorials at https://www.autonlab.org.)