Making clean cuts
The qcut
method divides the variable in n_bins
equal bins. In some cases, however, it is nice to choose your own bins. The method cut
in python allows you to choose your own bins.
This exercise is part of the course
Introduction to Predictive Analytics in Python
Exercise instructions
- Discretize the variable
number_gift
in three bins with borders 0 and 5, 5 and 10, 10 and 20 and assign this variable to a new column calleddisc_number_gift
. - Count the number of observations in each group.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Discretize the variable
basetable["disc_number_gift"] = pd.cut(____[____],[____, ____, ____, ____])
# Count the number of observations per group
print(basetable.groupby("____").____())