Tokenizing and counting
Explore the content of the airline tweets in twitter_data
through word counts. The content of each tweet is in the tweet_text
column.
This exercise is part of the course
Introduction to Text Analysis in R
Exercise instructions
- Load the tidyverse and tidytext packages.
- Tokenize the tweets in the
tweet_text
column. - Compute word counts using the tokenized text.
- Arrange the counts in descending order.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the tidyverse and tidytext packages
___(___)
___(___)
tidy_twitter <- twitter_data %>%
# Tokenize the twitter data
___(word, ___)
tidy_twitter %>%
# Compute word counts
___(___) %>%
# Arrange the counts in descending order
___(___)