Sample size in randomization distribution
We've created two new datasets for you with essentially the same difference in proportions as the original discrimination data. However, one of the datasets (disc_small
) is one third the size of the original dataset and the other (disc_big
) is 10 times larger than the original dataset.
Additionally, the same permutation code used previously has been run on the small and big datasets to create small and big distributions of permuted differences in promotion rates (disc_small_perm
and disc_big_perm
, respectively).
In this exercise, you'll use these two new distributions to get a sense for how the differences vary given widely different sample sizes. In particular, notice the range of variability on the x-axis of each plot.
Cet exercice fait partie du cours
Foundations of Inference in R
Exercice interactif pratique
Essayez cet exercice en complétant cet exemple de code.
# Tabulate the small dataset
disc_small %>%
# Select sex and promote
___(___, ___)
# Do the same for disc_big
___