1. 학습
  2. /
  3. 강의
  4. /
  5. Supervised Learning with scikit-learn

Connected

연습 문제

Dropping missing data

Over the next three exercises, you are going to tidy the music_df dataset. You will create a pipeline to impute missing values and build a KNN classifier model, then use it to predict whether a song is of the "Rock" genre.

In this exercise specifically, you will drop missing values accounting for less than 5% of the dataset, and convert the "genre" column into a binary feature.

지침 1/3

undefined XP
    1
    2
    3
  • Print the number of missing values for each column in the music_df dataset, sorted in ascending order.