Exercise

Grouped summaries

So there are more non-complaints than complaints in twitter_data. You might be starting to question whether or not this data is actually from Twitter! There are a few other columns of interest in twitter_data that would be helpful to explore before you get to the tweets themselves. Every tweet includes the number of followers that user has in the usr_followers_count column. Do you expect those who complain to have more users or fewer users, on average, than those who don't complain? You can use grouped summaries to quickly and easily provide an answer.

Instructions

100 XP
  • Group the data by complaint_label.
  • Compute the average, minimum, and maximum number of usr_followers_count.