Summary statistics
Recall from the previous lesson that NannyML provides five methods for tracking statistical changes in your features.
In this exercise, you will focus on examining the lead_time feature from the Hotel Booking dataset, which indicates how many days in advance a booking was made. By using summation, median, and standard deviation statistics, you can gain valuable insights into how customer booking behavior has evolved over time.
It's important to note that both the reference and analysis sets, as well as the nannyml library, are already pre-loaded and ready for use.
Questo esercizio fa parte del corso
Monitoring Machine Learning in Python
Esercizio pratico interattivo
Prova a risolvere questo esercizio completando il codice di esempio.
# Define analyzed column
analyzed_column = [____]
# Intialize sum values calculator
sum_calc = ____.____(
____=____,
chunk_period='m',
timestamp_column_name='timestamp'
)
# Fit, calculate and plot the results
sum_calc.fit(reference)
sum_calc_res = sum_calc.calculate(analysis)
sum_calc_res.plot().show()