Scaling Data Before PCA
When dealing with data that has features with different scales, it's often important to scale the data first. This is because data that has larger values may sway the data even with relatively little variability.
The combine
data frame is loaded for you.
Cet exercice fait partie du cours
Linear Algebra for Data Science in R
Instructions
- Use the
scale()
function to scale the 5th through the 12th columns ofcombine
data. Name this data frameB
and show some of the values usinghead()
. - Use
prcomp()
to perform principal component analysis on the data and summarize this analysis usingsummary()
.
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Scale columns 5-12 of combine
B <- ___(___[, 5:12])
# Print the first 6 rows of the data
___
# Summarize the principal component analysis
summary(____(B))