Selecting the proportion of variance to keep
You'll let PCA determine the number of components to calculate based on an explained variance threshold that you decide.
You'll work on the numeric ANSUR female dataset pre-loaded as ansur_df
.
All relevant packages and classes have been pre-loaded too (Pipeline()
, StandardScaler()
, PCA()
).
Diese Übung ist Teil des Kurses
Dimensionality Reduction in Python
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Pipe a scaler to PCA selecting 80% of the variance
pipe = ____([('scaler', ____),
('reducer', ____)])