Recoding the topics
There's one more step of data cleaning to make this more interpretable. Right now, topics are represented by two-letter codes:
- me: Palestinian conflict
- nu: Nuclear weapons and nuclear material
- di: Arms control and disarmament
- hr: Human rights
- co: Colonialism
- ec: Economic development
So that you can interpret the data more easily, recode the data to replace these codes with their full name. You can do that with dplyr
's recode()
function, which replaces values with ones you specify:
example <- c("apple", "banana", "apple", "orange")
recode(example,
apple = "plum",
banana = "grape")
This exercise is part of the course
Case Study: Exploratory Data Analysis in R
Exercise instructions
Use the recode()
function from dplyr
in a mutate()
to replace each two-letter code in the votes_gathered
data frame with the corresponding full name. Save this as votes_tidied
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Replace the two-letter codes in topic: votes_tidied
votes_tidied <- ___ %>%
mutate(topic = recode(___,
___ = "Palestinian conflict",
___ = "Nuclear weapons and nuclear material",
___ = "Arms control and disarmament",
___ = "Human rights",
___ = "Colonialism",
___ = "Economic development"))