TFIDF tibble vs dtm
TFIDF can be used for document similarity, text classification, and tasks. Consider the tibble, left_right_tfidf
, and the document-term matrix, left_right_matrix
. Both have been loaded into the console.
Which of the following statements is true?
- A: The tibble contains one row per document and a column for each word used in all of the text.
- B: The tibble contains the word counts, tf, idf, and tfidf weights for each word in each document.
- C: The tibble and the matrix have the same number of rows.
- D: The columns of the document-term matrix can be used in classification models.
This exercise is part of the course
Introduction to Natural Language Processing in R
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
