Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
25 lines (14 sloc) 898 Bytes

Lab: EDA and Feature Engineering in Pandas


Pandas: Final Practice Problems

In this homework, you're going to write code for a few problems on two datasets:

  • The iris dataset - a dataset of flowers whose species is classified by attributes of their flower sizes
  • The NCAA March Madness dataset - a collection of ranks for teams in the March Madness sportsball competition.

You'll practice the following programming concepts we've covered in class:

  • Basic EDA with Pandas.
  • Using the .apply() method to create new feature columns and mutate existing columns
  • Broadcasting, or implementing math transformations at column scale
  • Dropping columns
  • And much, much more!