Calculating the sample mean
The late_shipments
dataset contains supply chain data on the delivery of medical supplies. Each row represents one delivery of a part. The late
columns denotes whether or not the part was delivered late. A value of "Yes"
means that the part was delivered late, and a value of "No"
means the part was delivered on time.
You'll begin your analysis by calculating a point estimate (or sample statistic), namely the proportion of late shipments.
In pandas
, a value's proportion in a categorical DataFrame column can be quickly calculated using the syntax:
prop = (df['col'] == val).mean()
late_shipments
is available, and pandas
is loaded as pd
.
This exercise is part of the course
Hypothesis Testing in Python
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Print the late_shipments dataset
____