Get startedGet started for free

Stems from tweets

In this exercise, you will work with an array called tweets. It contains the text of the airline sentiment data collected from Twitter.

Your task is to work with this array and transform it into a list of tokens using list comprehension. After that, iterate over the list of tokens and create a stem out of each token. Remember that list comprehensions are a one-line alternative to for loops.

This exercise is part of the course

Sentiment Analysis in Python

View Course

Exercise instructions

  • Import the function we used to transform strings into stems.
  • Call the Porter stemmer function you just imported.
  • Using a list comprehension, create the list tokens. It should contain all the word tokens from the tweets array.
  • Iterate over the tokens list and apply the stemming function to each item in the list.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Import the function to perform stemming
____
from nltk import word_tokenize

# Call the stemmer
porter = ____()

# Transform the array of tweets to tokens
tokens = [____]
# Stem the list of tokens
stemmed_tokens = [[____.____(word) for word in tweet] for tweet in tokens] 
# Print the first element of the list
print(stemmed_tokens[0])
Edit and Run Code