LoslegenKostenlos loslegen

Aggregating numerical features

A good use case for taking an aggregate statistic to create a new feature is when you have many features with similar, related values. Here, you have a DataFrame of running times named running_times_5k. For each name in the dataset, take the mean of their 5 run times.

Diese Übung ist Teil des Kurses

Preprocessing for Machine Learning in Python

Kurs anzeigen

Anleitung zur Übung

  • Use the .loc[] method to select all rows and columns to find the .mean() of the each columns.
  • Print the .head() of the DataFrame to see the mean column.

Interaktive Übung

Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.

# Use .loc to create a mean column
running_times_5k["mean"] = ____.loc[____, ____].____(axis=____)

# Take a look at the results
print(____)
Code bearbeiten und ausführen