Transforming text to numbers with BoW
Now that you've built a vocabulary from the customer reviews, you're ready to transform each review into a numerical format using the Bag-of-Words (BoW) model. This step creates a structured matrix where each row represents a review and each column corresponds to a word from the vocabulary.
The cleaned_reviews
list and the fitted vectorizer
are pre-loaded for you.
This exercise is part of the course
Natural Language Processing (NLP) in Python
Exercise instructions
- Transform the
cleaned_reviews
into abow_matrix
. - Print the BoW representation as a NumPy array.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Transform the reviews
bow_matrix = vectorizer.____(____)
# Print the BoW representation
print(____.____())