Making clean cuts
The qcut method divides the variable in n_bins equal bins. In some cases, however, it is nice to choose your own bins. The method cut in python allows you to choose your own bins.
Deze oefening maakt deel uit van de cursus
Introduction to Predictive Analytics in Python
Oefeninstructies
- Discretize the variable
number_giftin three bins with borders 0 and 5, 5 and 10, 10 and 20 and assign this variable to a new column calleddisc_number_gift. - Count the number of observations in each group.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
# Discretize the variable
basetable["disc_number_gift"] = pd.cut(____[____],[____, ____, ____, ____])
# Count the number of observations per group
print(basetable.groupby("____").____())