- Your Team
- Course Overview
- Course Schedule
- Tech Requirements
- Classroom Tools: Slack
- Student Expectations
- Office Hours
- Student Feedback
In our part-time course, we will use Python to explore datasets, build predictive models, and communicate data driven insights.
Specifically, you will learn to:
- Define the language and approaches used by data scientists to solve real world problems.
- Perform exploratory data analysis with powerful programmatic tools, including the command line, python, and pandas.
- Build and refine basic machine learning models to predict patterns from data sets.
- Communicate data driven insights to peers and stakeholders in order to inform business decisions.
Course materials provided for this curriculum use Python 3.6; however, you can still access legacy Python 2.7 versions by checking out the
python2 branch for any given PT DS repository.
Your Instructional Team
Instructor: David Yakobovitch
Assistant: Chris Messier
General Assembly's Data Science part time materials are organized into four units.
|Unit 1||Foundations||Python Syntax, Development Environment||Lessons 1-4|
|Unit 2||Working with Data||Stats Review, Visualization, & EDA||Lessons 5-9|
|Unit 3||Data Modeling||Regression, Classification, & KNN||Lessons 10-14|
|Unit 4||Applications||Decision Trees, NLP, Common Models||Lessons 15-19|
Here is the schedule we will be following for our part time data science course:
|Lesson||Unit Number||Session Number|
|What is Data Science?||Unit 1||Session 1|
|Your Development Environment||Unit 1||Session 2|
|Python Foundations||Unit 1||Session 3|
|Review + Project Workshop||Unit 1||Session 4|
|Statistics Review||Unit 2||Session 5|
|Experiments & Hypothesis Testing||Unit 2||Session 6|
|Exploratory Data Analysis||Unit 2||Session 7|
|Data Visualization in Python||Unit 2||Session 8|
|Review + Project Workshop||Unit 2||Session 9|
|Linear Regression||Unit 3||Session 10|
|Train-Test Split & Bias-Variance||Unit 3||Session 11|
|kNN / Classification||Unit 3||Session 12|
|Logistic Regression||Unit 3||Session 13|
|Review + Project Workshop||Unit 3||Session 14|
|Decision Trees||Unit 4||Session 15|
|Natural Language Processing||Unit 4||Session 16|
|Clustering||Unit 4||Session 17|
|Getting Data from APIs||Unit 4||Session 18|
|Review + Project Workshop||Unit 4||Session 19|
|Project Presentations||Unit 4||Session 20|
This course will ask you to complete two sets of projects: short unit projects and a longer final project.
At the end of each unit, we'll ask you to complete a small project. These enrichment projects require you to synthesize the skills learned in that unit. There are three unit projects.
Note: Unit projects 1 and 2 are required, whereas Unit 3 project is optional... but strongly encouraged!
You'll also complete a longer final project, which asks you to apply your skills to a real world problem. At the end of the course, you'll be asked to share your final project with peers and colleagues.
The final project is broken down into five smaller deliverables, which walks you through every step of the data science workflow as you tackle a real world project.
- Project 1: Python Technical Code Challenges
- Project 2: EDA + Chipotle
- Project 3: Linear Regression and KNN Practice (Optional)
- Project 4: Final Project
- Part 1: Create Proposal
- Part 2: Identify Dataset
- Part 3: Perform EDA
- Part 4: Model Data
- Part 5: Present Findings
|Unit 1, Lesson 1||Review All Projects and Deliverables||Discussion|
|Unit 1, Lesson 4||Unit Project 1||Complete in-Class or as HW|
|Unit 2, Lesson 2||Review Final Project Datasets||Discussion or as HW|
|Unit 2, Lesson 5||Unit Project 2||Complete in-Class or as HW|
|Unit 2, Lesson 5||Final Project Pt 1: Create Problem statement||Assigned|
|Unit 3, Lesson 1||Final Project Pt 1: Create Problem statement||Due|
|Unit 3, Lesson 1||Final Project Pt 2: Define Data sources||Assigned|
|Unit 3, Lesson 3||Final Project Pt 2: Define Data sources||Due|
|Unit 3, Lesson 3||Final Project Pt 3: Perform EDA on Data||Assigned|
|Unit 3, Lesson 5||Unit Project 3||Optional: Complete In-Class or as HW|
|Unit 4, Lesson 1||Final Project Pt 3: Perform EDA on Data||Due|
|Unit 4, Lesson 1||Final Project Pt 4: Model Data||Assigned|
|Unit 4, Lesson 4||Final Project Pt 4: Model Data||Due|
|Unit 4, Lesson 4||Final Project Pt 5: Present Data||Assigned|
|Unit 4, Lesson 6||Final Project Pt 5: Present Data||Due|
Recommended Technology Requirements
- 8GB Ram (at least)
- 10GB Free Hard Drive Space (after installing Anaconda)
- Download and Install Anaconda with Python 3.6.
Note: Anaconda provides support for two different versions of Python. We'll primarily use Python 3.6 in this course.
- Install Git Bash
- Google Chome
- Text editor; we recommend Atom
We'll be using Slack for our in-class communications. Slack is a messaging platform where you can chat with your peers and instructors. We will use Slack to share information about the course, discuss lessons, and submit projects. Our Slack homepage is dsfundamentals.slack.com .
Pro Tip: If you've never used Slack before, check out these resources:
[Add specific local market attendance, student policy, and parking expectations here]
Every week, your instructional team will hold office hours where you can get in touch to ask questions about anything relating to the course. This is a great opportunity to follow up on questions or ask for more details about any topics covered so far.
- Tuesdays and Thursdays, before class (or by Appointment)
- Monday and Wednesday evenings by Slack
Slack us or post in our #officehours channel to reserve a time-slot!
Throughout the course, you'll be asked to provide feedback about your experience. This feedback is extremely important, as it helps us provide you with a better learning experience.
[Insert specific VTS/Exit Ticket details here]