Running the bootstrap
Good job writing calc_gender_coef()
in the last exercise! This function creates a bootstrap sample, imputes it and, outputs the linear regression coefficient describing the impact of movie subject's being a female on the movie's earnings.
In this exercise, you will use the boot
package in order to obtain a bootstrapped distribution of such coefficients. The spread of this distribution will capture the uncertainty from imputation. You will also look at how the bootstrapped distribution differs from a single-time imputation and regression. Let's do some bootstrapping!
This exercise is part of the course
Handling Missing Data with Imputations in R
Exercise instructions
- Load the
boot
package. - Run bootstrapping on
biopics
, passingcalc_gender_coef
asstatistic
and setR
to 50; assign the result toboot_results
. print()
andplot()
the bootstrapping results.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the boot library
___
# Run bootstrapping on biopics data
boot_results <- ___(___, statistic = ___, R = ___)
# Print and plot bootstrapping results
___
___