Exploratory data analysis
The first step of any data analysis, unsupervised or supervised, is to familiarize yourself with the data.
The variables you created before, wisc.data and diagnosis, are still available in your workspace. Explore the data to answer the following questions:
- How many observations are in this dataset?
- How many variables/features in the data are suffixed with
_mean? - How many of the observations have a malignant diagnosis?
Cet exercice fait partie du cours
Unsupervised Learning in R
Exercice interactif pratique
Passez de la théorie à la pratique avec l’un de nos exercices interactifs
Commencer l’exercice