Linear regression on the United States
A linear regression is a model that lets us examine how one variable changes with respect to another by fitting a best fit line. It is done with the lm()
function in R.
Here, you'll fit a linear regression to just the percentage of "yes" votes from the United States.
This exercise is part of the course
Case Study: Exploratory Data Analysis in R
Exercise instructions
- Print the
US_by_year
data to the console. - Using just the US data in
US_by_year
, uselm()
to run a linear regression predictingpercent_yes
fromyear
. Save this to a variableUS_fit
. - Summarize
US_fit
using thesummary()
function.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Percentage of yes votes from the US by year: US_by_year
US_by_year <- by_year_country %>%
filter(country == "United States")
# Print the US_by_year data
# Perform a linear regression of percent_yes by year: US_fit
# Perform summary() on the US_fit object