Generating & calculating
The infer pipeline for hypothesis testing requires four steps to calculate the null distribution: specify, hypothesize, generate, and calculate.
Let's continue the pipeline you began in the previous coding exercise. We'll get a set of differences in proportions that are distributed as though the null hypothesis, that the proportion of late shipments is the same across freight cost groups, is true.
late_shipments is available; dplyr, infer, and ggplot2 are loaded.
Questo esercizio fa parte del corso
Hypothesis Testing in R
Esercizio pratico interattivo
Prova a risolvere questo esercizio completando il codice di esempio.
# Extend the pipeline to generate 2000 permutations
generated <- late_shipments %>%
specify(
late ~ freight_cost_group,
success = "Yes"
) %>%
hypothesize(null = "independence") %>%
___
# See the result
generated