1. Learn
  2. /
  3. Courses
  4. /
  5. Preprocessing for Machine Learning in Python

Exercise

Text vectorization

You'll now transform the desc column in the UFO dataset into tf/idf vectors, since there's likely something we can learn from this field.

Instructions

100 XP
  • Print out the .head() of the desc column.
  • Instantiate a TfidfVectorizer() object.
  • Fit and transform the desc column using vec.
  • Print out the .shape of the desc_tfidf vector, to take a look at the number of columns this created.