Set Boolean columns
Datasets may have columns that are most accurately modeled as Boolean values. However, pandas
usually loads these as floats by default, since defaulting to Booleans may have undesired effects like turning NA values into True
s.
fcc_survey_subset.xlsx
contains a string ID column and several True/False columns indicating financial stressors. You'll evaluate which non-ID columns have no NA values and therefore can be set as Boolean, then tell read_excel()
to load them as such with the dtype
argument.
pandas
is loaded as pd
.
This exercise is part of the course
Streamlined Data Ingestion with pandas
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the data
survey_data = pd.read_excel("fcc_survey_subset.xlsx")
# Count NA values in each column
print(survey_data.____)