Removing titles and taking names
While collecting survey respondent metadata in the airlines DataFrame, the full name of respondents was saved in the full_name column. However upon closer inspection, you found that a lot of the different names are prefixed by honorifics such as "Dr.", "Mr.", "Ms." and "Miss".
Your ultimate objective is to create two new columns named first_name and last_name, containing the first and last names of respondents respectively. Before doing so however, you need to remove honorifics.
The airlines DataFrame is in your environment, alongside pandas as pd.
This exercise is part of the course
Cleaning Data in Python
Exercise instructions
- Remove
"Dr.","Mr.","Miss"and"Ms."fromfull_nameby replacing them with an empty string""in that order. - Run the
assertstatement using.str.contains()that tests whetherfull_namestill contains any of the honorifics.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Replace "Dr." with empty string ""
airlines['full_name'] = airlines['full_name'].____.____("____","")
# Replace "Mr." with empty string ""
airlines['full_name'] = ____
# Replace "Miss" with empty string ""
____
# Replace "Ms." with empty string ""
____
# Assert that full_name has no honorifics
assert airlines['full_name'].str.contains('Ms.|Mr.|Miss|Dr.').any() == False