Hypothetical population - less variability around the line
In order to understand the sampling distribution associated with the slope coefficient, it is valuable to visualize the impact changes in the sample and population have on the slope coefficient. Here, reducing the variance associated with the response variable around the line changes the variability associated with the slope statistics.
This exercise is part of the course
Inference for Linear Regression in R
Exercise instructions
- Look at the plot that has been drawn for you.
- Swap
popdata
fornew_popdata
in the sampling code, and redraw the plot. - Look at the new plot. How is it different?
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Update the sampling to use new_popdata
many_samples <- popdata %>%
rep_sample_n(size = 50, reps = 100)
# Rerun the plot; how does it change?
ggplot(many_samples, aes(x = explanatory, y = response, group = replicate)) +
geom_point() +
geom_smooth(method = "lm", se = FALSE)