Discretization of a certain variable
In order to make predictor insight graphs for continuous variables, you first need to discretize them. In Python, you can discretize pandas columns using the qcut
method.
To check whether the variable was nicely discretized, you can verify that the bins have equal size using the groupby
method:
print(basetable.groupby("discretized_variable").size()
Este ejercicio forma parte del curso
Introduction to Predictive Analytics in Python
Instrucciones del ejercicio
- Use the method
qcut
to discretize the variabletime_since_last_donation
in 10 groups. Assing this variable to a new column called "bins_recency". - Use the method
groupby
to verify that the bins have about equal size.
Ejercicio interactivo práctico
Prueba este ejercicio completando el código de muestra.
# Discretize the variable time_since_last_donation in 10 bins
basetable["bins_recency"] = pd.qcut(____,____)
# Print the group sizes of the discretized variable
print(basetable.groupby("____").size())