Train/test split
In order to test your models, you need to build and test the model on two different parts of the data - otherwise, it's like cheating on an exam (as you already know the answers).
The data split is an integral part of the modeling process. You will dive into this by splitting the diabetes data and confirming the split proportions.
The diabetes data from the last exercise is pre-loaded in your workspace.
Cet exercice fait partie du cours
Machine Learning with Tree-Based Models in R
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Create the split
diabetes_split <- ___(___, prop = ___)
# Print the data split
___