1. Learn
  2. /
  3. Courses
  4. /
  5. Preprocessing for Machine Learning in Python

Connected

Exercise

Aggregating numerical features

A good use case for taking an aggregate statistic to create a new feature is when you have many features with similar, related values. Here, you have a DataFrame of running times named running_times_5k. For each name in the dataset, take the mean of their 5 run times.

Instructions

100 XP
  • Use the .loc[] method to select all rows and columns to find the .mean() of the each columns.
  • Print the .head() of the DataFrame to see the mean column.