How sparse is my data?
Most datasets contain missing values, often represented as NaN (Not a Number). If you are working with Pandas you can easily check how many missing values exist in each column.
Let's find out how many of the developers taking the survey chose to enter their age (found in the Age
column of so_survey_df
) and their gender (Gender
column of so_survey_df
).
This exercise is part of the course
Feature Engineering for Machine Learning in Python
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Subset the DataFrame
sub_df = ____
# Print the number of non-missing values
print(sub_df.____)