Validating age
Now that you found some inconsistencies in the total
amounts, you're suspicious that there may also be inconsistencies in the acct_age
column, and you want to see if these inconsistencies are related. Using the skills you learned from the video exercise, you'll need to validate the age of each account and see if rows with inconsistent acct_age
s are the same ones that had inconsistent total
s
dplyr
and lubridate
are loaded, and accounts
is available.
This exercise is part of the course
Cleaning Data in R
Exercise instructions
- Create a new column called
theoretical_age
that contains the age of each account based on thedate_opened
. - Find the accounts where the
acct_age
doesn't match thetheoretical_age
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Find invalid acct_age
accounts %>%
# theoretical_age: age of acct based on date_opened
mutate(theoretical_age = ___) %>%
# Filter for rows where acct_age is different from theoretical_age
___