Dropping missing data
Now that you've explored the volunteer
dataset and understand its structure and contents, it's time to begin dropping missing values.
In this exercise, you'll drop both columns and rows to create a subset of the volunteer
dataset.
Diese Übung ist Teil des Kurses
Preprocessing for Machine Learning in Python
Anleitung zur Übung
- Drop the
Latitude
andLongitude
columns fromvolunteer
, storing asvolunteer_cols
. - Subset
volunteer_cols
by dropping rows containing missing values in thecategory_desc
, and store in a new variable calledvolunteer_subset
. - Take a look at the
.shape
attribute ofvolunteer_subset
, to verify it worked correctly.
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Drop the Latitude and Longitude columns from volunteer
volunteer_cols = ____
# Drop rows with missing category_desc values from volunteer_cols
volunteer_subset = ____
# Print out the shape of the subset
print(____.____)