Session Ready
Exercise

Loading tweets into a DataFrame

Now it's time to import data into a pandas DataFrame so we can analyze tweets at scale.

We will work with a dataset of tweets which contain the hashtag '#rstats' or '#python'. This dataset is stored as a list of tweet JSON objects in data_science_json.

This course touches on a lot of concepts you may have forgotten, so if you ever need a quick refresher, download the pandas basics Cheat Sheet and keep it handy!

Be aware that this is real data from Twitter and as such there is always a risk for the presence of profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).

Instructions
100 XP
  • Import pandas (remember, by convention we'll alias it as pd).
  • Flatten the data_science_json tweets with flatten_tweets() and store them in tweets.
  • Create a DataFrame from tweets using pd.DataFrame().
  • Print out the text from the first 5 tweets.