Get startedGet started for free

Tokenizing and counting

Explore the content of the airline tweets in twitter_data through word counts. The content of each tweet is in the tweet_text column.

This exercise is part of the course

Introduction to Text Analysis in R

View Course

Exercise instructions

  • Load the tidyverse and tidytext packages.
  • Tokenize the tweets in the tweet_text column.
  • Compute word counts using the tokenized text.
  • Arrange the counts in descending order.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Load the tidyverse and tidytext packages
___(___)
___(___)

tidy_twitter <- twitter_data %>% 
  # Tokenize the twitter data
  ___(word, ___) 

tidy_twitter %>% 
  # Compute word counts
  ___(___) %>% 
  # Arrange the counts in descending order
  ___(___)
Edit and Run Code