1. Learn
  2. /
  3. Courses
  4. /
  5. Natural Language Processing (NLP) in Python

Connected

Exercise

Sentence and word tokenization

Tokenization is an important first step in NLP. It involves breaking text into smaller units called tokens, which is key to working with language data. Your task is to tokenize a snippet of a news article into both sentences and words.

Instructions 1/2

undefined XP
    1
    2
  • Import the nltk library.
  • Download the punkt_tab package.
  • Tokenize the text into sentences.