1. Learn
  2. /
  3. Courses
  4. /
  5. Introduction to Text Analysis in R

Exercise

Airline tweets data

The twitter_data data frame has over 7,000 tweets about airlines. The tweets have already been classified as either complaints or non-complaints in the complaint_label column. Let's get a sense of how many of these tweets are complaints.

Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).

Instructions

100 XP
  • Load the tidyverse package.
  • Get a sense of the size and content of the data by printing twitter_data.
  • Filter twitter_data so it's just the complaints. How many complaints are in the data?