Train/test split
In order to test your models, you need to build and test the model on two different parts of the data - otherwise, it's like cheating on an exam (as you already know the answers).
The data split is an integral part of the modeling process. You will dive into this by splitting the diabetes data and confirming the split proportions.
The diabetes
data from the last exercise is pre-loaded in your workspace.
Diese Übung ist Teil des Kurses
Machine Learning with Tree-Based Models in R
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Create the split
diabetes_split <- ___(___, prop = ___)
# Print the data split
___