Class imbalance
In the volunteer
dataset, you're thinking about trying to predict the category_desc
variable using the other features in the dataset. First, though, you need to know what the class distribution (and imbalance) is for that label.
Which descriptions occur less than 50 times in the volunteer
dataset?
Cet exercice fait partie du cours
Preprocessing for Machine Learning in Python
Exercice interactif pratique
Passez de la théorie à la pratique avec l’un de nos exercices interactifs
