Aan de slagGa gratis aan de slag

Comparing #python to #rstats

Now that we have a function to check whether or not the word is in the tweet in multiple places, we can deploy this across multiple words and compare them. Let's return to our example with the data science hashtag dataset. We want to see how many times that #rstats occurs compared to #python.

The data science hashtag dataset ds_tweets has been loaded for you.

Deze oefening maakt deel uit van de cursus

Analyzing Social Media Data in Python

Cursus bekijken

Oefeninstructies

  • Use the function check_word_in_tweet() to find all instances of #python in the text fields of ds_tweets.
  • Do the same with #rstats.
  • Print proportion of tweets mentioning #python by summing python with np.sum() and dividing it by ds_tweets.shape[0].
  • Do the same for rstats.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Find mentions of #python in all text fields
python = ____(____, ds_tweets)

# Find mentions of #rstats in all text fields
rstats = ____(____, ____)

# Print proportion of tweets mentioning #python
print("Proportion of #python tweets:", ____(____) / ____)

# Print proportion of tweets mentioning #rstats
print("Proportion of #rstats tweets:", ____(____) / ____)
Code bewerken en uitvoeren