LoslegenKostenlos loslegen

Text vectorization

You'll now transform the desc column in the UFO dataset into tf/idf vectors, since there's likely something we can learn from this field.

Diese Übung ist Teil des Kurses

Preprocessing for Machine Learning in Python

Kurs anzeigen

Anleitung zur Übung

  • Print out the .head() of the desc column.
  • Instantiate a TfidfVectorizer() object.
  • Fit and transform the desc column using vec.
  • Print out the .shape of the desc_tfidf vector, to take a look at the number of columns this created.

Interaktive Übung

Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.

# Take a look at the head of the desc field
print(____)

# Instantiate the tfidf vectorizer object
vec = ____

# Fit and transform desc using vec
desc_tfidf = vec.____

# Look at the number of columns and rows
print(____.shape)
Code bearbeiten und ausführen