Train/test split
In order to test your models, you need to build and test the model on two different parts of the data - otherwise, it's like cheating on an exam (as you already know the answers).
The data split is an integral part of the modeling process. You will dive into this by splitting the diabetes data and confirming the split proportions.
The diabetes data from the last exercise is pre-loaded in your workspace.
Deze oefening maakt deel uit van de cursus
Machine Learning with Tree-Based Models in R
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Create the split
diabetes_split <- ___(___, prop = ___)
# Print the data split
___