Isolation forest with factors
As you saw in the video, an isolation forest can accept categorical features as input, but only if they are encoded as factor variables.
In this exercise, the thyroid data you edited in the previous exercise is preloaded. To be extra careful, you should first check that all of the features are numeric or factor before attempting to train an isolation forest.
This exercise is part of the course
Introduction to Anomaly Detection in R
Exercise instructions
- Check that the
ageandsexcolumns in thethyroiddata are encoded as factors. - Then train an isolation forest with 100 trees.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Check the class of age column
___
# Check the class of sex column
___
# Fit an isolation forest with 100 trees
thyroid_for <- ___