Get startedGet started for free

Comparing #python to #rstats

Now that we have a function to check whether or not the word is in the tweet in multiple places, we can deploy this across multiple words and compare them. Let's return to our example with the data science hashtag dataset. We want to see how many times that #rstats occurs compared to #python.

The data science hashtag dataset ds_tweets has been loaded for you.

This exercise is part of the course

Analyzing Social Media Data in Python

View Course

Exercise instructions

  • Use the function check_word_in_tweet() to find all instances of #python in the text fields of ds_tweets.
  • Do the same with #rstats.
  • Print proportion of tweets mentioning #python by summing python with np.sum() and dividing it by ds_tweets.shape[0].
  • Do the same for rstats.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Find mentions of #python in all text fields
python = ____(____, ds_tweets)

# Find mentions of #rstats in all text fields
rstats = ____(____, ____)

# Print proportion of tweets mentioning #python
print("Proportion of #python tweets:", ____(____) / ____)

# Print proportion of tweets mentioning #rstats
print("Proportion of #rstats tweets:", ____(____) / ____)
Edit and Run Code