Fitting an LDA
It's time to run your first topic model! As discussed, the three additional arguments of the LDA()
function are critical for properly running a topic model. Note that running the LDA()
function could take about 10 seconds. The tidyverse
and tidytext
packages along with the tidy_twitter
dataset have been loaded for you.
This exercise is part of the course
Introduction to Text Analysis in R
Exercise instructions
- Load the
topicmodels
package. - Cast the word counts by tweet into a DTM.
- Run an LDA with 2 topics and a Gibbs sampler.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the topicmodels package
___
# Cast the word counts by tweet into a DTM
dtm_twitter <- ___ %>%
___(___) %>%
___(___)
# Run an LDA with 2 topics and a Gibbs sampler
lda_out <- LDA(
___,
k = ___,
method = ___,
control = list(seed = 42)
)