Predict bike rentals on new data
In this exercise, you will use the model you built in the previous exercise to make predictions for the month of August. The dataset bikesAugust
has the same columns as bikesJuly
.
Recall that you must specify type = "response"
with predict()
(docs) when predicting counts from a glm
poisson or quasipoisson model.
The model bike_model
and the bikesAugust
data frame have been pre-loaded.
This exercise is part of the course
Supervised Learning in R: Regression
Exercise instructions
- Use
predict
to predict the number of bikes per hour on thebikesAugust
data. Assign the predictions to the columnbikesAugust$pred
. - Fill in the blanks to get the RMSE of the predictions on the August data.
- Fill in the blanks to generate the plot of predictions to actual counts.
- Do any of the predictions appear negative?
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# bikesAugust is available
str(bikesAugust)
# bike_model is available
summary(bike_model)
# Make predictions on August data
bikesAugust$pred <- ___
# Calculate the RMSE
bikesAugust %>%
mutate(residual = ___) %>%
summarize(rmse = ___)
# Plot predictions vs cnt (pred on x-axis)
ggplot(bikesAugust, aes(x = ___, y = ___)) +
geom_point() +
geom_abline(color = "darkblue")