Comparing models
Choosing the best of multiple competing models can be tricky if these models are built on incomplete data. In this exercise, you will extend the model you have built previously by adding one more explanatory variable: the race of the movie's subject. Then, you will try to compare it to the previous model.
As a reminder, this is how you have fitted the first model:
model_1 <- lm(earnings ~ country + year + sub_type,
data = biopics)
Let's see if we can judge whether adding the race variable improves the model!
This exercise is part of the course
Handling Missing Data with Imputations in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Fit linear regression to predict earnings
model_2 <- ___(___,
___ = ___)