Exploratory data analysis

The first step of any data analysis, unsupervised or supervised, is to familiarize yourself with the data.

The variables you created before, wisc.data and diagnosis, are still available in your workspace. Explore the data to answer the following questions:

  1. How many observations are in this dataset?
  2. How many variables/features in the data are suffixed with _mean?
  3. How many of the observations have a malignant diagnosis?

This exercise is part of the course

Unsupervised Learning in R

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise