This repository contains resources and cheatsheets that will be helpful for the course (and after it!).
Switch branches/tags
Nothing to show
Clone or download
Latest commit 62a4098 Feb 15, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Cheat-Sheets fix pandas cheat sheet path Feb 12, 2019
Data-Analysis.md Add Pandas practice Jan 18, 2019
Data-Visualization.md Add AutoViz Feb 6, 2019
Datasets.md Fix datasets link Feb 4, 2019
Enviroment.md update dev enviornment tutorials Feb 13, 2019
FAQ.md remove gap between lines Feb 13, 2019
Machine-Learning.md Update Machine-Learning.md Nov 4, 2018
NLP.md Add nlp datasets Oct 5, 2018
Python.md Reorganize Python resources Feb 5, 2019
README.md Add FAQ Feb 4, 2019
Statistics.md fix link for cs 229 Feb 15, 2019

README.md

Data Science Resources

This repository contains resources and cheatsheets that will be helpful for the course (and after it!).

  • Cheat Sheets - a lot of useful cheat sheets for Python, data analysis, machine learning, Git and more.
  • Frequently Asked Questions - a list of links (mostly to Stack Overflow) of common questions and problems relevant to the course.
  • Development Environment - development environment resources: Command Line, Git, Jupyter Notebook, etc.
  • Python - basic and intermediate Python guides, tips and resources.
  • Data Analysis - data analysis resources: Pandas and NumPy libraries, Exploratory Data Analysis (EDA), etc.
  • Data Visualization - various data visualization guides - Pandas plotting, Matplotlib, Seaborn, Bokeh.
  • Machine Learning - different machine learning guides: supervised learning (regression, classification, tree-based models etc), unsupervised learning (clustering), feature selection, model evaluation, etc.
  • Natural Language Processing - Natural Language Processing resources: NLTK, SKLearn NLP and more.
  • Statistics and Math - mostly optional reading for students who would like to deepen their understanding of statistics and math.
  • Datasets - links to interesting datasets.