Character count of Russian tweets
In this exercise, you have been given a dataframe tweets
which contains some tweets associated with Russia's Internet Research Agency and compiled by FiveThirtyEight.
Your task is to create a new feature 'char_count' in tweets
which computes the number of characters for each tweet. Also, compute the average length of each tweet. The tweets are available in the content
feature of tweets
.
Be aware that this is real data from Twitter and as such there is always a risk that it may contain profanity or other offensive content (in this exercise, and any following exercises that also use real Twitter data).
Cet exercice fait partie du cours
Feature Engineering for NLP in Python
Instructions
- Create a new feature
char_count
by applyinglen
to the 'content' feature oftweets
. - Print the average character count of the tweets by computing the mean of the 'char_count' feature.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Create a feature char_count
tweets['char_count'] = tweets[____].apply(____)
# Print the average character count
print(tweets[____].____)