Comparing #python to #rstats
Now that we have a function to check whether or not the word is in the tweet in multiple places, we can deploy this across multiple words and compare them. Let's return to our example with the data science hashtag dataset. We want to see how many times that #rstats
occurs compared to #python
.
The data science hashtag dataset ds_tweets
has been loaded for you.
This exercise is part of the course
Analyzing Social Media Data in Python
Exercise instructions
- Use the function
check_word_in_tweet()
to find all instances of#python
in the text fields ofds_tweets
. - Do the same with
#rstats
. - Print proportion of tweets mentioning
#python
by summingpython
withnp.sum()
and dividing it byds_tweets.shape[0]
. - Do the same for
rstats
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Find mentions of #python in all text fields
python = ____(____, ds_tweets)
# Find mentions of #rstats in all text fields
rstats = ____(____, ____)
# Print proportion of tweets mentioning #python
print("Proportion of #python tweets:", ____(____) / ____)
# Print proportion of tweets mentioning #rstats
print("Proportion of #rstats tweets:", ____(____) / ____)