LoslegenKostenlos loslegen

Discretization of a certain variable

In order to make predictor insight graphs for continuous variables, you first need to discretize them. In Python, you can discretize pandas columns using the qcut method.

To check whether the variable was nicely discretized, you can verify that the bins have equal size using the groupby method:

print(basetable.groupby("discretized_variable").size()

Diese Übung ist Teil des Kurses

Introduction to Predictive Analytics in Python

Kurs anzeigen

Anleitung zur Übung

  • Use the method qcut to discretize the variable time_since_last_donation in 10 groups. Assing this variable to a new column called "bins_recency".
  • Use the method groupby to verify that the bins have about equal size.

Interaktive Übung

Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.

# Discretize the variable time_since_last_donation in 10 bins
basetable["bins_recency"] = pd.qcut(____,____)

# Print the group sizes of the discretized variable
print(basetable.groupby("____").size())
Code bearbeiten und ausführen