Try another name
You are still working on your Twitter sentiment analysis. You analyze now some things that caught your attention. You noticed that there are email addresses inserted in some tweets. Now, you are curious to find out which is the most common name.
You want to extract the first part of the email. E.g. if you have the email [email protected], you are only interested in marysmith90.
You need to match the entire expression. So you make sure to extract only names present in emails. Also, you are only interested in names containing upper (e.g. A,B, Z) or lowercase letters (e.g. a, d, z) and numbers.
The list sentiment_analysis containing the text of three tweets as well as the re module were loaded in your session. You can use print() to view it in the IPython Shell.
Diese Übung ist Teil des Kurses
Regular Expressions in Python
Anleitung zur Übung
- Complete the regex to match the email capturing only the name part. The name part appears before the
@. - Find all matches of the regex in each element of
sentiment_analysisanalysis. Assign it to the variableemail_matched. - Complete the
.format()method to print the results captured in each element ofsentiment_analysisanalysis.
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Write a regex that matches email
regex_email = r"___[____]____\S+"
for tweet in sentiment_analysis:
# Find all matches of regex in each tweet
email_matched = re.____(____, ____)
# Complete the format method to print the results
print("Lists of users found in this tweet: {}".format(____))