Get startedGet started for free

Sample size in randomization distribution

We've created two new datasets for you with essentially the same difference in proportions as the original discrimination data. However, one of the datasets (disc_small) is one third the size of the original dataset and the other (disc_big) is 10 times larger than the original dataset.

Additionally, the same permutation code used previously has been run on the small and big datasets to create small and big distributions of permuted differences in promotion rates (disc_small_perm and disc_big_perm, respectively).

In this exercise, you'll use these two new distributions to get a sense for how the differences vary given widely different sample sizes. In particular, notice the range of variability on the x-axis of each plot.

This exercise is part of the course

Foundations of Inference in R

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Tabulate the small dataset
disc_small %>% 
  # Select sex and promote
  ___(___, ___)
  
# Do the same for disc_big
___
Edit and Run Code