Feature exploration
Using the same Avazu dataset, you will explore how the values of device_type and banner_pos are distributed, as well as how CTR varies based on them.
Sample data in DataFrame form is loaded as df. The X and y variables that you created in the last exercise are available in your workspace. pandas as pd are also available in your workspace.
Cet exercice fait partie du cours
Predicting CTR with Machine Learning in Python
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Distribution of values for device type
print("Distribution of device type: ")
print(X.device_type.____()/len(X))
# Sample CTR by device type
print("CTR by device type: ")
print(df.____('device_type')['click'].____/len(y))