Feature importances
Although some candy attributes, such as chocolate, may be extremely popular, it doesn't mean they will be important to model prediction. After a random forest model has been fit, you can review the model's attribute, .feature_importances_, to see which variables had the biggest impact. You can check how important each variable was in the model by looping over the feature importance array using enumerate().
If you are unfamiliar with Python's enumerate() function, it can loop over a list while also creating an automatic counter.
Deze oefening maakt deel uit van de cursus
Model Validation in Python
Oefeninstructies
- Loop through the feature importance output of
rfr. - Print the column names of
X_trainand the importance score for that column.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Fit the model using X and y
rfr.fit(X_train, y_train)
# Print how important each column is to the model
for i, item in enumerate(rfr.____):
# Use i and item to print out the feature importance of each column
print("{0:s}: {1:.2f}".format(X_train.columns[____], ____))