Importing and joining the accident data
It's the final chapter! Congratulations on making it this far. This chapter's scenario is that a senior executive believes that workplace accidents have increased this past year at the production sites. She wants you to find out if that's true, and if it is, to look into what might be driving the increase.
Start by importing the HR and accident datasets. Then join them together, and add a had_accident
variable to make it easier to analyze accident rates.
This exercise is part of the course
HR Analytics: Exploring Employee Data in R
Exercise instructions
- Import
"hr_data_2.csv"
and"accident_data.csv"
withread_csv()
. Assign them tohr_data
andaccident_data
, respectively. - Use
left_join()
to add the accident data to the HR data. Join on both employee ID and year. - Using
%>%
after the join, usemutate()
to addhad_accident
, which is 0 whenaccident_type
isNA
, and 1 otherwise.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the packages
library(readr)
library(dplyr)
# Import the data
hr_data <- ___
accident_data <- ___
# Create hr_joined with left_join() and mutate()
hr_joined <- ___ %>%
mutate(___)
hr_joined