Aan de slagGa gratis aan de slag

Detecting outliers with IForest

IForest is a robust estimator and only requires a few lines of code to detect outliers from any dataset. You may find that this syntax looks familiar since it closely resembles sklearn syntax.

The full version of the Big Mart Sales data has been loaded for you as big_mart, which you can explore in the console.

Deze oefening maakt deel uit van de cursus

Anomaly Detection in Python

Cursus bekijken

Oefeninstructies

  • Import the IForest estimator from pyod.
  • Initialize an IForest() with default parameters.
  • Fit the estimator and generate predictions on the big_mart simultaneously, and store the results in labels.
  • Use pandas subsetting to filter out the outliers from big_mart.

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Import IForest from pyod
from pyod.____ import ____

# Initialize an instance with default parameters
iforest = ____

# Generate outlier labels
labels = ____

# Filter big_mart for outliers
outliers = ____

print(outliers.shape)
Code bewerken en uitvoeren