Get startedGet started for free

Text vectorization

You'll now transform the desc column in the UFO dataset into tf/idf vectors, since there's likely something we can learn from this field.

This exercise is part of the course

Preprocessing for Machine Learning in Python

View Course

Exercise instructions

  • Print out the .head() of the desc column.
  • Instantiate a TfidfVectorizer() object.
  • Fit and transform the desc column using vec.
  • Print out the .shape of the desc_tfidf vector, to take a look at the number of columns this created.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Take a look at the head of the desc field
print(____)

# Instantiate the tfidf vectorizer object
vec = ____

# Fit and transform desc using vec
desc_tfidf = vec.____

# Look at the number of columns and rows
print(____.shape)
Edit and Run Code