Evaluate the folds
Now that you fit 10 models using all 10 of your folds and calculated the MAE and RMSE of each of these models, it's time to visualize how large the errors are. This way, you build an intuition of the out-of-sample error distribution, which is helpful in assessing your model quality.
You will plot all these errors as a histogram and display the summary statistics across all folds.
The result of the previous exercise, fits_cv, is pre-loaded.
Cet exercice fait partie du cours
Machine Learning with Tree-Based Models in R
Instructions
- Collect the out-of-sample errors of all models of
fits_cvusing a singleyardstickfunction and save them asall_errors. - Create a
ggplot2histogram using the.estimateas thexaesthetic andfillthe bars by.metric. - Use the same function as in the first instruction with
summarize = TRUEto display summary statistics offits_cv.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
library(ggplot2)
# Collect the errors
all_errors <- ___(___, summarize = ___)
# Plot an error histogram
ggplot(___, aes(___, ___)) +
___()
# Collect and print error statistics
___(fits_cv, ___)