Feature importances
Although some candy attributes, such as chocolate, may be extremely popular, it doesn't mean they will be important to model prediction. After a random forest model has been fit, you can review the model's attribute, .feature_importances_
, to see which variables had the biggest impact. You can check how important each variable was in the model by looping over the feature importance array using enumerate()
.
If you are unfamiliar with Python's enumerate()
function, it can loop over a list while also creating an automatic counter.
Diese Übung ist Teil des Kurses
Model Validation in Python
Anleitung zur Übung
- Loop through the feature importance output of
rfr
. - Print the column names of
X_train
and the importance score for that column.
Interaktive Übung
Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.
# Fit the model using X and y
rfr.fit(X_train, y_train)
# Print how important each column is to the model
for i, item in enumerate(rfr.____):
# Use i and item to print out the feature importance of each column
print("{0:s}: {1:.2f}".format(X_train.columns[____], ____))