Character count of Russian tweets
In this exercise, you have been given a dataframe tweets
which contains some tweets associated with Russia's Internet Research Agency and compiled by FiveThirtyEight.
Your task is to create a new feature 'char_count' in tweets
which computes the number of characters for each tweet. Also, compute the average length of each tweet. The tweets are available in the content
feature of tweets
.
Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).
This exercise is part of the course
Feature Engineering for NLP in Python
Exercise instructions
- Create a new feature
char_count
by applyinglen
to the 'content' feature oftweets
. - Print the average character count of the tweets by computing the mean of the 'char_count' feature.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Create a feature char_count
tweets['char_count'] = tweets[____].apply(____)
# Print the average character count
print(tweets[____].____)