Cleaning up strings
In this lesson, we learned some basics of "regex," or regular expressions, which allow us to capture general patterns. We've covered two notations:
| Expression | Does this |
|---|---|
. |
matches any character |
* |
zero or more times |
For example, ".*science " would match "data science " in the string "data science rocks!"
Let's use what we've learned to change the response_var in the dataset you created in the previous lesson, gathered_data.
Cet exercice fait partie du cours
Categorical Data in the Tidyverse
Instructions
- Use
str_removeto remove everything before and including "rude to " (with the space at the end) in theresponse_varcolumn. - Use
str_removeto remove "on a plane" from theresponse_varcolumn.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
gathered_data %>%
# Remove everything before and including "rude to " (with that space at the end!)
mutate(response_var = ___(response_var, ___)) %>%
# Remove "on a plane"
mutate(response_var = ___(response_var, ___))