CommencerCommencer gratuitement

Create one holdout set

Your boss has asked you to create a simple random forest model on the tic_tac_toe dataset. She doesn't want you to spend much time selecting parameters; rather she wants to know how well the model will perform on future data. For future Tic-Tac-Toe games, it would be nice to know if your model can predict which player will win.

The dataset tic_tac_toe has been loaded for your use.

Note that in Python, =\ indicates the code was too long for one line and has been split across two lines.

Cet exercice fait partie du cours

Model Validation in Python

Afficher le cours

Instructions

  • Create the X dataset by creating dummy variables for all of the categorical columns.
  • Split X and y into train (X_train, y_train) and test (X_test, y_test) datasets.
  • Split the datasets using 10% for testing

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Create dummy variables using pandas
X = ____.____(tic_tac_toe.iloc[:,0:9])
y = tic_tac_toe.iloc[:, 9]

# Create training and testing datasets. Use 10% for the test set
____, ____, ____, ____  = ____(X, y, ____=____, random_state=1111)
Modifier et exécuter le code