CommencerCommencer gratuitement

A timeline compliant population

Assume that you want to construct a basetable for a predictive model that predicts whether donors will donate in 2018. The timeline indicates that the population should contain all donors that donated at least once since January 1st 2013, but made no donations after January 1st 2017. Given is a pandas dataframe gifts with all the donations made since 2010. In this exercise, you will construct a set with the donor ids of all donors in the population.

Cet exercice fait partie du cours

Intermediate Predictive Analytics in Python

Afficher le cours

Instructions

  • Construct a dataframe gifts_include containing all gifts made in 2013 or later and a dataframe gifts_exclude containing all gifts made in 2017 or later.
  • Construct a set donors_include containing all donor ids of donors in gifts_include and a set donors_exclude containing all donor ids of donors in gifts_exclude.
  • Construct the population using the .difference() method on your two sets.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Gifts made in 2013 or later
gifts_include = ____[____[____].dt.year >= ____]

# Gifts made in 2017 or later
gifts_exclude = ____[____[____].dt.year >= ____]

# Set with ids in gifts_include
donors_include = ____(____[____])

# Set with ids in gifts_exclude
donors_exclude = ____(____[____])

# Population
population = ____.difference(____)
print(len(population))
Modifier et exécuter le code