Get startedGet started for free

Creating time series data frame

Time series data is used when we want to analyze or explore variation over time. This is useful when exploring Twitter text data if we want to track the prevalence of a word or set of words.

The first step in doing this is converting the DataFrame into a format which can be handled using pandas time series methods. That can be done by converting the index to a datetime type.

This exercise is part of the course

Analyzing Social Media Data in Python

View Course

Exercise instructions

  • Print the first five rows of created_at in ds_tweets with the .head() method.
  • Convert that column to a datetime type with the Pandas' .to_datetime() method.
  • Print the first five rows once again.
  • Set index to created_at with .set_index().

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Print created_at to see the original format of datetime in Twitter data
print(ds_tweets[____].____())

# Convert the created_at column to np.datetime object
ds_tweets[____] = pd.____(____)

# Print created_at to see new format
print(ds_tweets[____].____())

# Set the index of ds_tweets to created_at
ds_tweets = ds_tweets.____(____)
Edit and Run Code