Get startedGet started for free

Linear regression algorithm

To truly understand linear regression, it is helpful to know how the algorithm works. The code for lm() is hundreds of lines because it has to work with any formula and any dataset. However, in the case of simple linear regression for a single dataset, you can implement a linear regression algorithm in just a few lines of code.

The workflow is

  1. Write a script to calculate the sum of squares.
  2. Turn this into a function.
  3. Use R's general purpose optimization function find the coefficients that minimize this.

The explanatory values (the n_convenience column of taiwan_real_estate) are available as x_actual. The response values (the price_twd_msq column of taiwan_real_estate) are available as y_actual.

This exercise is part of the course

Intermediate Regression in R

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Set the intercept to 10
intercept <- ___

# Set the slope to 1
slope <- ___

# Calculate the predicted y values
y_pred <- ___

# Calculate the differences between actual and predicted
y_diff <- ___

# Calculate the sum of squares
___
Edit and Run Code