How sparse is my data?

Most datasets contain missing values, often represented as NaN (Not a Number). If you are working with Pandas you can easily check how many missing values exist in each column.

Let's find out how many of the developers taking the survey chose to enter their age (found in the Age column of so_survey_df) and their gender (Gender column of so_survey_df).

This exercise is part of the course

Feature Engineering for Machine Learning in Python

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Subset the DataFrame
sub_df = ____

# Print the number of non-missing values
print(sub_df.____)