Grouping & aggregating
You'll be using .groupby()
and .agg()
a lot in this course, so it's important to become comfortable with them. In this exercise, your job is to calculate a set of summary statistics about the purchase data broken out by 'device'
(Android or iOS) and 'gender'
(Male or Female).
Following this, you'll compare the values across these subsets, which will give you a baseline for these values as potential KPIs to optimize going forward.
The purchase_data
DataFrame from the previous exercise has been pre-loaded for you. As a reminder, it contains purchases merged with user demographics.
Este exercício faz parte do curso
Customer Analytics and A/B Testing in Python
Instruções do exercício
- Group the
purchase_data
DataFrame by'device'
and'gender'
in that order. - Aggregate
grouped_purchase_data
, finding the'mean'
,'median'
, and the standard deviation ('std'
) of the purchase price, in that order, across these groups. - Examine the results. Does the mean differ drastically from the median? How much variability is in each group?
Exercício interativo prático
Experimente este exercício completando este código de exemplo.
# Group the data
grouped_purchase_data = purchase_data.____(____ = ['____', '____'])
# Aggregate the data
purchase_summary = grouped_purchase_data.____({'price': ['____', '____', '____']})
# Examine the results
print(purchase_summary)