Filtering summarized output
In the last exercise, you may have noticed that the country that voted least frequently, Zanzibar, had only 2 votes in the entire dataset. You certainly can't make any substantial conclusions based on that data!
Typically in a progressive analysis, when you find that a few of your observations have very little data while others have plenty, you set some threshold to filter them out.
This exercise is part of the course
Case Study: Exploratory Data Analysis in R
Exercise instructions
Use filter()
to remove from the sorted data countries that have fewer than 100 votes.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Filter out countries with fewer than 100 votes
by_country %>%
arrange(percent_yes)