Class imbalance
In the volunteer
dataset, you're thinking about trying to predict the category_desc
variable using the other features in the dataset. First, though, you need to know what the class distribution (and imbalance) is for that label.
Which descriptions occur less than 50 times in the volunteer
dataset?
Diese Übung ist Teil des Kurses
Preprocessing for Machine Learning in Python
Interaktive Übung
Setze die Theorie in einer unserer interaktiven Übungen in die Praxis um
