Filtering summarized output

In the last exercise, you may have noticed that the country that voted least frequently, Zanzibar, had only 2 votes in the entire dataset. You certainly can't make any substantial conclusions based on that data!

Typically in a progressive analysis, when you find that a few of your observations have very little data while others have plenty, you set some threshold to filter them out.

This exercise is part of the course

Case Study: Exploratory Data Analysis in R

View Course

Exercise instructions

Use filter() to remove from the sorted data countries that have fewer than 100 votes.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Filter out countries with fewer than 100 votes
by_country %>%
  arrange(percent_yes)