Get startedGet started for free

Calculating the sample mean

The late_shipments dataset contains supply chain data on the delivery of medical supplies. Each row represents one delivery of a part. The late columns denotes whether or not the part was delivered late. A value of "Yes" means that the part was delivered late, and a value of "No" means the part was delivered on time.

You'll begin your analysis by calculating a point estimate (or sample statistic), namely the proportion of late shipments.

In pandas, a value's proportion in a categorical DataFrame column can be quickly calculated using the syntax:

prop = (df['col'] == val).mean()

late_shipments is available, and pandas is loaded as pd.

This exercise is part of the course

Hypothesis Testing in Python

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Print the late_shipments dataset
____
Edit and Run Code