Are these findings generalizable?
Let's look at another sample to see if it is representative of the population. This time, you'll look at the duration_minutes
column of the Spotify dataset, which contains the length of the song in minutes.
spotify_population
and spotify_mysterious_sample2
are available; dplyr
and ggplot2
are loaded.
This exercise is part of the course
Sampling in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Visualize the distribution of duration_minutes as a histogram with a binwidth of 0.5
___