1. Learn
  2. /
  3. Courses
  4. /
  5. Feature Engineering for NLP in Python

Connected

Exercise

tf-idf vectors for TED talks

In this exercise, you have been given a corpus ted which contains the transcripts of 500 TED Talks. Your task is to generate the tf-idf vectors for these talks.

In a later lesson, we will use these vectors to generate recommendations of similar talks based on the transcript.

Instructions

100 XP
  • Import TfidfVectorizer from sklearn.
  • Create a TfidfVectorizer object. Name it vectorizer.
  • Generate tfidf_matrix for ted using the fit_transform() method.