Session Ready
Exercise

Remove stop words and additional spaces

The text corpus usually has many common words like a, an, the, of, and but. These are called stop words.

Stop words are usually removed during text processing so one can focus on the important words in the corpus to derive insights.

Also, the additional spaces created during the removal of special characters, punctuation, numbers, and stop words need to be removed from the corpus.

The corpus that you created in the last exercise has been pre-loaded as twt_corpus_lwr.

The library tm has been pre-loaded for this exercise.

Instructions 1/2
undefined XP
  • 1
  • 2
  • Remove English stop words from the corpus twt_corpus_lwr.