Session Ready
Exercise

Class imbalance

In the volunteer dataset, we're thinking about trying to predict the category_desc variable using the other features in the dataset. First, though, we need to know what the class distribution (and imbalance) is for that label.

Which descriptions occur less than 50 times in the volunteer dataset?

  • The dataset volunteer has been provided.
  • The colum you want to check is category_desc.
  • Use the value_counts() method to check variable counts.
Instructions
50 XP
Possible Answers