Summary statistics
Recall from the previous lesson that NannyML provides five methods for tracking statistical changes in your features.
In this exercise, you will focus on examining the lead_time
feature from the Hotel Booking dataset, which indicates how many days in advance a booking was made. By using summation, median, and standard deviation statistics, you can gain valuable insights into how customer booking behavior has evolved over time.
It's important to note that both the reference and analysis sets, as well as the nannyml
library, are already pre-loaded and ready for use.
Cet exercice fait partie du cours
Monitoring Machine Learning in Python
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Define analyzed column
analyzed_column = [____]
# Intialize sum values calculator
sum_calc = ____.____(
____=____,
chunk_period='m',
timestamp_column_name='timestamp'
)
# Fit, calculate and plot the results
sum_calc.fit(reference)
sum_calc_res = sum_calc.calculate(analysis)
sum_calc_res.plot().show()