Session Ready
Exercise

PCA in a model pipeline

We just saw that legendary Pokemon tend to have higher stats overall. Let's see if we can add a classifier to our pipeline that detects legendary versus non-legendary Pokemon based on the principal components.

The data has been pre-loaded for you and split into training and tests datasets: X_train, X_test, y_train, y_test.

Same goes for all relevant packages and classes(Pipeline(), StandardScaler(), PCA(), RandomForestClassifier()).

Instructions 1/4
undefined XP
  • 1
  • 2
  • 3
  • 4
  • Add a scaler, PCA limited to 2 components, and random forest classifier with random_state=0 to the pipeline.