Univariate drift detection for hotel booking dataset
In the previous exercises, we established using the multivariate drift detection method that the shift in data in January is responsible for the alert in the ROC AUC metric and the negative business value of the model.
In this exercise, you will use a univariate drift detection method to find the feature and explanation behind the drift.
The reference
and analysis
sets are already pre-loaded for you.
Este ejercicio forma parte del curso
Monitoring Machine Learning in Python
Instrucciones del ejercicio
- Specify Wasserstein and Jensen-Shannon method for continuous methods and L-inifity and Chi2 for categorical.
- Fit the reference and calculate results on the analysis set.
- Plot the results.
Ejercicio interactivo práctico
Prueba este ejercicio y completa el código de muestra.
# Intialize the univariate drift calculator
uv_calc = nannyml.UnivariateDriftCalculator(
column_names=feature_column_names,
timestamp_column_name='timestamp',
chunk_period='m',
continuous_methods=[____, ____],
categorical_methods=[____, ____],
)
# Plot the results
uv_calc.____(reference)
uv_results = uv_calc.____(analysis)
____.____().____()