Identifying inconsistency
In the video exercise, you learned about different kinds of inconsistencies that can occur within categories, making it look like a variable has more categories than it should.
In this exercise, you'll continue working with the sfo_survey
dataset. You'll examine the dest_size
column again as well as the cleanliness
column and determine what kind of issues, if any, these two categorical variables face.
dplyr
and is loaded and sfo_survey
is available.
This exercise is part of the course
Cleaning Data in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Count dest_size
sfo_survey %>%
___