Exploratory data analysis
The first step of any data analysis, unsupervised or supervised, is to familiarize yourself with the data.
The variables you created before, wisc.data and diagnosis, are still available in your workspace. Explore the data to answer the following questions:
- How many observations are in this dataset?
- How many variables/features in the data are suffixed with
_mean? - How many of the observations have a malignant diagnosis?
Deze oefening maakt deel uit van de cursus
Unsupervised Learning in R
Praktische interactieve oefening
Zet theorie om in actie met een van onze interactieve oefeningen.
Begin met trainen