Exploratory data analysis
The first step of any data analysis, unsupervised or supervised, is to familiarize yourself with the data.
The variables you created before, wisc.data
and diagnosis
, are still available in your workspace. Explore the data to answer the following questions:
- How many observations are in this dataset?
- How many variables/features in the data are suffixed with
_mean
? - How many of the observations have a malignant diagnosis?
This exercise is part of the course
Unsupervised Learning in R
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
