LoslegenKostenlos loslegen

A/B test data cleaning

A/B testing data cleaning is no different from the usual process taught in data cleaning and manipulation courses. However, understanding the context and data logging behavior of each A/B test on a case-by-case basis enables you to make decisions about how to handle messy data. Dropping or keeping duplicates or missing values are two cases where we need to be careful with our metric definitions and how consistent these are across variants.

AdSmart and homepage DataFrames as well as pandas and numpy libraries are loaded for you.

The Adsmart Kaggle dataset source is linked here.

Diese Übung ist Teil des Kurses

A/B Testing in Python

Kurs anzeigen

Interaktive Übung

Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.

# Check for duplicate rows due to logging issues 
print(____(AdSmart))
print(____(AdSmart.____(keep='____')))
Code bearbeiten und ausführen