Aan de slagGa gratis aan de slag

Visualizing drifting features

After ranking the univariate results, you know that drift hotel and country features are impacting the model's performance the most. In this exercise, you will look at the drift results and distribution plots of them to determine the root cause of the problem.

The results from the univariate drift calculator are stored in the uv_results variable.

Deze oefening maakt deel uit van de cursus

Monitoring Machine Learning in Python

Cursus bekijken

Oefeninstructies

  • Set period argument to analysis for drift_results.
  • Pass hotel and country to column_names for drift_results.
  • Set kind argument in .plot() method to "drift".
  • Do the same for distribution_results, except for setting the kind argument in .plot() method to "distribution".

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Filter and create drift plots
drift_results = uv_results.filter(
    period=____,
    column_names=[____, ____]
    ).plot(kind=____)

# Filter and create distribution plots
distribution_results = uv_results.filter(
    period=____,
    column_names=[____, ____]
    ).plot(kind=____)

# Show the plots
drift_results.show()
distribution_results.show()
Code bewerken en uitvoeren