Get startedGet started for free

Airline tweets data

The twitter_data data frame has over 7,000 tweets about airlines. The tweets have already been classified as either complaints or non-complaints in the complaint_label column. Let's get a sense of how many of these tweets are complaints.

Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).

This exercise is part of the course

Introduction to Text Analysis in R

View Course

Exercise instructions

  • Load the tidyverse package.
  • Get a sense of the size and content of the data by printing twitter_data.
  • Filter twitter_data so it's just the complaints. How many complaints are in the data?

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Load the tidyverse packages
___(___)

# Print twitter_data
___

# Print just the complaints in twitter_data
twitter_data %>% 
  ___(___ == ___)
Edit and Run Code