Airline tweets data
The twitter_data
data frame has over 7,000 tweets about airlines. The tweets have already been classified as either complaints or non-complaints in the complaint_label
column. Let's get a sense of how many of these tweets are complaints.
Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).
This exercise is part of the course
Introduction to Text Analysis in R
Exercise instructions
- Load the
tidyverse
package. - Get a sense of the size and content of the data by printing
twitter_data
. - Filter
twitter_data
so it's just the complaints. How many complaints are in the data?
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the tidyverse packages
___(___)
# Print twitter_data
___
# Print just the complaints in twitter_data
twitter_data %>%
___(___ == ___)