1. Learn
  2. /
  3. Courses
  4. /
  5. Dimensionality Reduction in Python

Exercise

PCA in a model pipeline

We just saw that legendary Pokemon tend to have higher stats overall. Let's see if we can add a classifier to our pipeline that detects legendary versus non-legendary Pokemon based on the principal components.

The data has been pre-loaded for you and split into training and tests datasets: X_train, X_test, y_train, y_test.

Same goes for all relevant packages and classes(Pipeline(), StandardScaler(), PCA(), RandomForestClassifier()).

Instructions 1/4

undefined XP
    1
    2
    3
    4
  • Add a scaler, PCA limited to 2 components, and random forest classifier with random_state=0 to the pipeline.