Aggregating numerical features
A good use case for taking an aggregate statistic to create a new feature is when you have many features with similar, related values. Here, you have a DataFrame of running times named running_times_5k. For each name in the dataset, take the mean of their 5 run times.
Deze oefening maakt deel uit van de cursus
Preprocessing for Machine Learning in Python
Oefeninstructies
- Use the
.loc[]method to select all rows and columns to find the.mean()of the each columns. - Print the
.head()of the DataFrame to see themeancolumn.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Use .loc to create a mean column
running_times_5k["mean"] = ____.loc[____, ____].____(axis=____)
# Take a look at the results
print(____)