Exploratory data analysis
The first step of any data analysis, unsupervised or supervised, is to familiarize yourself with the data.
The variables you created before, wisc.data and diagnosis, are still available in your workspace. Explore the data to answer the following questions:
- How many observations are in this dataset?
- How many variables/features in the data are suffixed with
_mean? - How many of the observations have a malignant diagnosis?
This exercise is part of the course
Unsupervised Learning in R
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise