1
Foundations of "tidy" Machine learning
Free
This chapter will introduce you to the backbone of machine learning in the tidyverse, the List Column Workflow (LCW). The LCW will empower you to work with many models in one dataframe.
This chapter will also introduce you to the fundamentals of the broom package for exploring your models.
2
Multiple Models with broom
This chapter leverages the List Column Workflow to build and explore the attributes of 77 models. You will use the tools from the broom package to gain a multidimensional understanding of all of these models.
3
Build, Tune & Evaluate Regression Models
In this chapter you will learn how to use the List Column Workflow to build, tune and evaluate regression models. You will have the chance to work with two types of models: linear models and random forest models.
4
Build, Tune & Evaluate Classification Models
In this chapter you will shift gears to build, tune and evaluate classification models.

Initializing

Build a random forest model

Here you will use the same cross-validation data to build (using train) and evaluate (using validate) random forests for each partition. Since you are using the same cross-validation partitions as your regression models, you are able to directly compare the performance of the two models.

Note: We will limit our random forests to contain 100 trees to ensure they finish fitting in a reasonable time. The default number of trees for ranger() is 500.

Use ranger() to build a random forest predicting life_expectancy using all features in train for each cross validation partition.
Add a new column validate_predicted predicting the life_expectancy for the observations in validate using the random forest models you just created.