Selecting the proportion of variance to keep
You'll let PCA determine the number of components to calculate based on an explained variance threshold that you decide.
You'll work on the numeric ANSUR female dataset pre-loaded as ansur_df
.
All relevant packages and classes have been pre-loaded too (Pipeline()
, StandardScaler()
, PCA()
).
Este ejercicio forma parte del curso
Dimensionality Reduction in Python
Ejercicio interactivo práctico
Prueba este ejercicio y completa el código de muestra.
# Pipe a scaler to PCA selecting 80% of the variance
pipe = ____([('scaler', ____),
('reducer', ____)])