1. Learn
  2. /
  3. Courses
  4. /
  5. Natural Language Processing with spaCy

Exercise

Tokenization with spaCy

In this exercise, you'll practice tokenizing text. You'll use the first review from the Amazon Fine Food Reviews dataset for this exercise. You can access this review by using the text object provided.

The en_core_web_sm model is already loaded for you. You can access it by calling nlp(). You can use list comprehension to compile output lists.

Instructions

100 XP
  • Store Doc container for the pre-loaded review in a document object.
  • Store and review texts of all the tokens of the document in the variable first_text_tokens.