CommencerCommencer gratuitement

Text vectorization

You'll now transform the desc column in the UFO dataset into tf/idf vectors, since there's likely something we can learn from this field.

Cet exercice fait partie du cours

Preprocessing for Machine Learning in Python

Afficher le cours

Instructions

  • Print out the .head() of the desc column.
  • Instantiate a TfidfVectorizer() object.
  • Fit and transform the desc column using vec.
  • Print out the .shape of the desc_tfidf vector, to take a look at the number of columns this created.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Take a look at the head of the desc field
print(____)

# Instantiate the tfidf vectorizer object
vec = ____

# Fit and transform desc using vec
desc_tfidf = vec.____

# Look at the number of columns and rows
print(____.shape)
Modifier et exécuter le code