tf-idf vectors for TED talks
In this exercise, you have been given a corpus ted which contains the transcripts of 500 TED Talks. Your task is to generate the tf-idf vectors for these talks.
In a later lesson, we will use these vectors to generate recommendations of similar talks based on the transcript.
Bu egzersiz
Feature Engineering for NLP in Python
kursunun bir parçasıdırEgzersiz talimatları
- Import
TfidfVectorizerfromsklearn. - Create a
TfidfVectorizerobject. Name itvectorizer. - Generate
tfidf_matrixfortedusing thefit_transform()method.
Uygulamalı interaktif egzersiz
Bu örnek kodu tamamlayarak bu egzersizi bitirin.
# Import TfidfVectorizer
from ____ import ____
# Create TfidfVectorizer object
____
# Generate matrix of word vectors
tfidf_matrix = vectorizer.____(____)
# Print the shape of tfidf_matrix
print(tfidf_matrix.shape)