Get startedGet started for free

Transforming text to numbers with BoW

Now that you've built a vocabulary from the customer reviews, you're ready to transform each review into a numerical format using the Bag-of-Words (BoW) model. This step creates a structured matrix where each row represents a review and each column corresponds to a word from the vocabulary.

The cleaned_reviews list and the fitted vectorizer are pre-loaded for you.

This exercise is part of the course

Natural Language Processing (NLP) in Python

View Course

Exercise instructions

  • Transform the cleaned_reviews into a bow_matrix.
  • Print the BoW representation as a NumPy array.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Transform the reviews
bow_matrix = vectorizer.____(____)

# Print the BoW representation
print(____.____())
Edit and Run Code