Lab: EDA and Feature Engineering in Pandas
Pandas: Final Practice Problems
In this homework, you're going to write code for a few problems on two datasets:
- The iris dataset - a dataset of flowers whose species is classified by attributes of their flower sizes
- The NCAA March Madness dataset - a collection of ranks for teams in the March Madness sportsball competition.
You'll practice the following programming concepts we've covered in class:
- Basic EDA with Pandas.
- Using the
.apply()method to create new feature columns and mutate existing columns
- Broadcasting, or implementing math transformations at column scale
- Dropping columns
- And much, much more!