A/B testing

1. A/B testing

Welcome back! Let's apply what you have learned so far to A/B testing!

2. A/B testing

A/B testing is a method for assessing user experience based on a randomized experiment, in which we divide our users into two groups.

3. A/B testing

We expose each group to a different version of something, for instance, we show each group a different version of a website layout.

4. A/B testing

Then, we compare the two groups based on some metric, such as which website version generated a higher click-through rate. Such an A/B test allows us to choose the better version of the website that would subsequently be shown to all users.

5. A/B testing: frequentist way

The typical, frequentist approach to A/B testing is based on a statistical procedure known as hypothesis testing. The main drawback of this approach is that we can only conclude which group is better, but not how much better it is.

6. A/B testing: Bayesian approach

Alternatively, the Bayesian approach allows us to calculate the posterior click-through rates for websites A and B, compare them directly, and calculate the probability that one is better than the other. We can also quantify how much better it is, and even estimate the expected loss in case we make a wrong decision and deploy the worse website version.

7. A/B testing: Bayesian approach

You already know how to set up Bayesian A/B testing. To model whether a user clicks or doesn't click you can use the binomial distribution, with a click being a success, and the click rate being the probability of success.

8. Simulate beta posterior

We've seen that for binomial data, a beta prior would generate a beta posterior according to these formulas which you have already seen before. This allows us to simply sample the posterior draws from the appropriate beta distribution. Here is a custom function that you will use for this, called simulate_beta_posterior. It implements the formulas above, and it's the same as the get_heads_prob function you have used before. The only difference is that next to the 0-1 data, you can pass the two beta prior parameters as arguments. As a result, you get 10000 posterior draws, just like before.

9. Comparing posteriors

Imagine you have a list of 1s (clicks) and 0s (no-clicks) based on the website traffic for two website layouts: A and B. You can use the simulate_beta_posterior function to simulate posterior draws. Here, we are using a beta-1-1 prior. We can plot the two posteriors to see that B seems to be better, although the two overlap.

10. Comparing posteriors

We can subtract one from the other to calculate the posterior difference between click rates. It's very likely to be positive, which corresponds to B being better. To get the explicit probability of B being better than A, we can create a Boolean array that is True when B is better and False otherwise, and compute its mean. Here, there is a 96% probability that the B website layout is better!

11. Expected loss

We can also estimate the expected loss resulting from accidentally deploying a worse version. First, we slice the difference between the two posteriors to take only the rare cases where A is better. This is our loss. Then, we take the average to get the expected loss. If we deploy version B, which we know is better with 96% probability, but the 4% risk materializes and it turns out A was better, we will only lose 0-point-7 percentage points in the click-through rate.

12. Ads data

In this chapter, you will work with ads data adapted from Kaggle which contains information on whether ad banners of different products displayed on different site versions were clicked or not.

13. Let's A/B test!

Let's A/B test!

This exercise is part of the course

Bayesian Data Analysis in Python

IntermediateSkill Level

4.8+

Start Course for Free

Take your first steps in the Bayesian world. In this chapter, you’ll be introduced to the basic concepts of probability and statistical distributions, as well as to the famous Bayes' Theorem, the cornerstone of Bayesian methods. Finally, you’ll build your first Bayesian model to draw conclusions from randomized coin tosses.

Exercise 1: Who is Bayes? What is Bayes?Exercise 2: Bayesians vs. Frequentists Exercise 3: Probability distributions Exercise 4: Probability and Bayes' Theorem Exercise 5: Let's play cards Exercise 6: Bayesian spam filter Exercise 7: What does the test say?Exercise 8: Tasting the Bayes Exercise 9: Tossing a coin Exercise 10: The more you toss, the more you learn Exercise 11: Hey, is this coin fair?

It’s time to look under the Bayesian hood. You’ll learn how to apply Bayes' Theorem to drug-effectiveness data to estimate the parameters of probability distributions using the grid approximation technique, and update these estimates as new data become available. Next, you’ll learn how to incorporate prior knowledge into the model before finally practicing the important skill of reporting results to a non-technical audience.

Exercise 1: Under the Bayesian hood Exercise 2: Towards grid approximation Exercise 3: Grid approximation without prior knowledge Exercise 4: Updating posterior belief Exercise 5: Prior belief Exercise 6: The truth of the prior Exercise 7: Picking the right prior Exercise 8: Simulating posterior draws Exercise 9: Reporting Bayesian results Exercise 10: Point estimates Exercise 11: Highest Posterior Density credible intervals Exercise 12: The meaning of credibility

Apply your newly acquired Bayesian data analysis skills to solve real-world business challenges. You’ll work with online sales marketing data to conduct A/B tests, decision analysis, and forecasting with linear regression models.

Exercise 1: A/B testing

Current Exercise

Exercise 2: Simulate beta posterior Exercise 3: Posterior click rates Exercise 4: A or B, and how sure are we?Exercise 5: How bad can it be?Exercise 6: Decision analysis Exercise 7: Decision analysis: cost Exercise 8: Decision analysis: profit Exercise 9: Regression and forecasting Exercise 10: Defining a Bayesian regression model Exercise 11: Analyzing regression parameters Exercise 12: Predictive distribution

In this final chapter, you’ll take advantage of the powerful PyMC3 package to easily fit Bayesian regression models, conduct sanity checks on a model's convergence, select between competing models, and generate predictions for new data. To wrap up, you’ll apply what you’ve learned to find the optimal price for avocados in a Bayesian data analysis case study. Good luck!

Exercise 1: Markov Chain Monte Carlo and model fitting Exercise 2: Markov Chain Monte Carlo Exercise 3: Sampling posterior draws Exercise 4: Interpreting results and comparing models Exercise 5: Inspecting posterior draws Exercise 6: Comparing models with WAIC Exercise 7: Making predictions Exercise 8: Sample from predictive density Exercise 9: Estimating test error Exercise 10: How much is an avocado?Exercise 11: Fitting the model Exercise 12: Inspecting the model Exercise 13: Optimizing the price Exercise 14: Final remarks