ComenzarEmpieza gratis

Create one holdout set

Your boss has asked you to create a simple random forest model on the tic_tac_toe dataset. She doesn't want you to spend much time selecting parameters; rather she wants to know how well the model will perform on future data. For future Tic-Tac-Toe games, it would be nice to know if your model can predict which player will win.

The dataset tic_tac_toe has been loaded for your use.

Note that in Python, =\ indicates the code was too long for one line and has been split across two lines.

Este ejercicio forma parte del curso

Model Validation in Python

Ver curso

Instrucciones del ejercicio

  • Create the X dataset by creating dummy variables for all of the categorical columns.
  • Split X and y into train (X_train, y_train) and test (X_test, y_test) datasets.
  • Split the datasets using 10% for testing

Ejercicio interactivo práctico

Prueba este ejercicio completando el código de muestra.

# Create dummy variables using pandas
X = ____.____(tic_tac_toe.iloc[:,0:9])
y = tic_tac_toe.iloc[:, 9]

# Create training and testing datasets. Use 10% for the test set
____, ____, ____, ____  = ____(X, y, ____=____, random_state=1111)
Editar y ejecutar código