Vectorizing text
You'll now transform the volunteer dataset's title column into a text vector, which you'll use in a prediction task in the next exercise.
Deze oefening maakt deel uit van de cursus
Preprocessing for Machine Learning in Python
Oefeninstructies
- Store the
volunteer["title"]column in a variable namedtitle_text. - Instantiate a
TfidfVectorizerastfidf_vec. - Transform the text in
title_textinto a tf-idf vector usingtfidf_vec.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Take the title text
title_text = ____
# Create the vectorizer method
tfidf_vec = ____
# Transform the text into tf-idf vectors
text_tfidf = tfidf_vec.____(____)