Collecting data on keywords
Now that we've set up the authentication, we can begin to collect Twitter data. Recall that with the Streaming API, we will be collecting real-time Twitter data based on either a sample or filtered by a keyword.
In our example, we will collect data on any tweet mentioning #rstats
or #python
in the tweet text, username, or user description with the filter
endpoint.
The SListener
module has already been defined and imported for you.
This exercise is part of the course
Analyzing Social Media Data in Python
Exercise instructions
- Import
Stream
fromtweepy
. - Set
keywords_to_track
to a list containing#rstats
and#python
. - Pass the
auth
andlisten
objects toStream
. - Set the keyword argument
track
equals tokeywords_to_track
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
from ____ import ____
# Set up words to track
keywords_to_track = ____
# Instantiate the SListener object
listen = SListener(api)
# Instantiate the Stream object
stream = Stream(____, ____, access_token, access_token_secret)
# Begin collecting data
stream.filter(track = ____)