Assigning topics to documents

Creating LDA models are useless unless you can interpret and use the results. You have been given the results of running an LDA model, sentence_lda on a set of sentences, pig_sentences. You need to explore both the beta, top words by topic, and the gamma, top topics per document, matrices to fully understand the results of any LDA analysis.

Given what you know about these two matrices, extract the results for a specific topic and see if the output matches expectations.

Cet exercice fait partie du cours

Introduction to Natural Language Processing in R

Afficher le cours

Instructions

Create a tibble for both the beta and gamma matrices.
Explore topic 5 by looking at the top words for topic 5 while arranging the results decreasing beta values.
Explore topic 5 by seeing which sentences most align with topic 5 while arranging the results by decreasing gamma values.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Extract the beta and gamma matrices
sentence_betas <- tidy(sentence_lda, ___ = "___")
sentence_gammas <- tidy(sentence_lda, ___ = "___")

# Explore Topic 5 Betas
___ %>%
  ___(topic == ___) %>%
  arrange(-___)

# Explore Topic 5 Gammas
___ %>%
  ___(topic == ___) %>%
  arrange(-___)

Modifier et exécuter le code

Introduction to Natural Language Processing in R

IntermédiaireNiveau de compétence

4.9+

23 reviews

In chapter 4 we cover two staples of natural language processing, sentiment analysis, and word embeddings. These are two analysis techniques that are a must for anyone learning the fundamentals of text analysis. Furthermore, you will briefly learn about BERT, part-of-speech tagging, and named entity recognition. Almost 15 different analysis techniques were covered in this course, so chapter 4 ends by recapping all of the great techniques you will learn about in this course.

Exercise 1: Sentiment analysis Exercise 2: tidytext lexicons Exercise 3: Sentiment scores Exercise 4: Sentiment and emotion Exercise 5: Word embeddings Exercise 6: h2o practice Exercise 7: word2vec Exercise 8: Additional NLP analysis Exercise 9: Reviewing methods #1 Exercise 10: Review methods #2 Exercise 11: Conclusion