Session Ready
Exercise

TFIDF tibble vs dtm

TFIDF can be used for document similarity, text classification, and tasks. Consider the tibble, left_right_tfidf, and the document-term matrix, left_right_matrix. Both have been loaded into the console.

Which of the following statements is true?

  • A: The tibble contains one row per document and a column for each word used in all of the text.
  • B: The tibble contains the word counts, tf, idf, and tfidf weights for each word in each document document.
  • C: The tibble and the matrix have the same number of rows.
  • D: The columns of the document-term matrix can be used in classification models.
Instructions
50 XP
Possible Answers