Varying training set size

The size of your training and testing sets influences model performance. Models learn better when they have more training data. However, there's a risk that they overfit to the training data and don't generalize well to new data, so in order to properly evaluate the model's ability to generalize, you need enough testing data. As a result, there is a important balance and trade-off involved between how much you use for training and how much you hold for testing.

So far, you've used 70% for training and 30% for testing. Let's now use 80% of the data for training and evaluate how that changes the model's performance.

Create training and testing sets, with 80% of data used for training and 20% held for testing.

Exploratory Data Analysis

Preprocessing for Churn Modeling

Churn Prediction

Model Tuning

Exercicio

Varying training set size

Instruções 1/3