Isolation forest with factors
As you saw in the video, an isolation forest can accept categorical features as input, but only if they are encoded as factor
variables.
In this exercise, the thyroid
data you edited in the previous exercise is preloaded. To be extra careful, you should first check that all of the features are numeric
or factor
before attempting to train an isolation forest.
This exercise is part of the course
Introduction to Anomaly Detection in R
Exercise instructions
- Check that the
age
andsex
columns in thethyroid
data are encoded as factors. - Then train an isolation forest with 100 trees.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Check the class of age column
___
# Check the class of sex column
___
# Fit an isolation forest with 100 trees
thyroid_for <- ___