Airline tweets data
The twitter_data data frame has over 7,000 tweets about airlines. The tweets have already been classified as either complaints or non-complaints in the complaint_label column. Let's get a sense of how many of these tweets are complaints.
Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).
Diese Übung ist Teil des Kurses
Introduction to Text Analysis in R
Anleitung zur Übung
- Load the
tidyversepackage. - Get a sense of the size and content of the data by printing
twitter_data. - Filter
twitter_dataso it's just the complaints. How many complaints are in the data?
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Load the tidyverse packages
___(___)
# Print twitter_data
___
# Print just the complaints in twitter_data
twitter_data %>%
___(___ == ___)