Initial exploration of the data
You are presented with data from the Occupational Employment Statistics (OES) program which produces employment and wage estimates annually. This data contains the yearly average income from 2001 to 2016 for 22 occupation groups. You would like to use this data to identify clusters of occupations that maintained similar income trends.
The data is stored in your environment as the data.matrix oes
.
Before you begin to cluster this data you should determine whether any pre-processing steps (such as scaling and imputation) are necessary.
Leverage the functions head()
and summary()
to explore the oes
data in order to determine which of the pre-processing steps below are necessary:
This exercise is part of the course
Cluster Analysis in R
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
