MulaiMulai sekarang secara gratis

Lazy train-test split

You have transformed the X variables. Now you need to finish your data prep by transforming the y variables and splitting your data into train and test sets.

The variables X and y, which you created in the last exercise, are available in your environment.

Latihan ini adalah bagian dari kursus

Parallel Programming with Dask in Python

Lihat Kursus

Petunjuk latihan

  • Import the train_test_split() function from dask_ml.model_selection.
  • The popularity scores in y are in the range 0-100, divide them by 100 so they are in the range 0-1.
  • Split the data into train and test sets using the train_test_split() function, make sure to shuffle the data, and set the test fraction to 20% of the data.

Latihan interaktif praktis

Cobalah latihan ini dengan menyelesaikan kode contoh berikut.

# Import the train_test_split function
from ____ import ____

# Rescale the target values
y = ____

# Split the data into train and test sets
X_train, X_test, y_train, y_test = ____

print(X_train)
Edit dan Jalankan Kode