Building a reduced model
Variable importance analysis helped you identify the most predictive features from the attrition
dataset. Based on it, you will build a drastically reduced model with only three variables: OverTime
, DistanceFromHome
, and NumCompaniesWorked
and compare its performance to the full model baseline. The metrics you estimated for the full model are stored in aug_full
.
All data, along with the train
and test splits, is available in your environment.
Cet exercice fait partie du cours
Feature Engineering in R
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Create a recipe using the formula syntax that includes only OverTime, DistanceFromHome and NumCompaniesWorked as predictors
recipe_reduced <-
___(Attrition ~ ___ + ___ + ___, data = train)
# Bundle the recipe with your model
workflow_reduced <-
workflow() %>%
add_model(model) %>%
___