1
Introduction to Hypothesis Testing
Free
Learn why hypothesis testing is useful, and step through the workflow for a one sample proportion test. In doing so, you'll encounter important concepts like z-scores, p-p-values, and false negative and false positive errors. The Stack Overflow survey and late medical shipments datasets are introduced.
2
Two-Sample and ANOVA Tests
Learn how to test for differences in means between two groups using t-tests, and how to extend this to more than two groups using ANOVA and pairwise t-tests.
3
Proportion Tests
Learn how to test for differences in proportions between two groups using proportion tests, extended it to more than two groups with chi-square independence tests, and return to the one sample case with chi-square goodness of fit tests.
4
Non-Parametric Tests
Learn about the assumptions made by parametric hypothesis tests and see how simulation-based and rank-based non-parametric tests can be used when those assumptions aren't met.

Initializing

Simulation-based t-test

In Chapter 2 you manually performed the steps for a t-test to explore these hypotheses.

\(H_{0}\): The mean weight of shipments that weren't late is the same as the mean weight of shipments that were late.

\(H_{A}\): The mean weight of shipments that weren't late is less than the mean weight of shipments that were late.

You can run the test more concisely using infer's t_test().

late_shipments %>% 
  t_test(
    weight_kilograms ~ late,
    order = c("No", "Yes"),
    alternative = "less"
  )

t_test() assumes that the null distribution is normal. We can avoid assumptions by using a simulation-based non-parametric equivalent.

late_shipments is available; dplyr and infer are loaded.

Specify weight in kilograms versus whether or not the shipment was late.
Declare a null hypothesis of independence.
Generate 1000 permutation replicates.
Calculate the difference in means, setting the order as "No" minus "Yes".