Transforming text to numbers with BoW
Now that you've built a vocabulary from the customer reviews, you're ready to transform each review into a numerical format using the Bag-of-Words (BoW) model. This step creates a structured matrix where each row represents a review and each column corresponds to a word from the vocabulary.
The cleaned_reviews list and the fitted vectorizer are pre-loaded for you.
Deze oefening maakt deel uit van de cursus
Natural Language Processing (NLP) in Python
Oefeninstructies
- Transform the
cleaned_reviewsinto abow_matrix. - Print the BoW representation as a NumPy array.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Transform the reviews
bow_matrix = vectorizer.____(____)
# Print the BoW representation
print(____.____())