Aan de slagGa gratis aan de slag

Replicating samples

When you calculate a point estimate such as a sample mean, the value you calculate depends on the rows that were included in the sample. That means that there is some randomness in the answer. In order to quantify the variation caused by this randomness, you can create many samples and calculate the sample mean (or other statistic) for each sample.

attrition_pop is available; dplyr and ggplot2 are loaded.

Deze oefening maakt deel uit van de cursus

Sampling in R

Cursus bekijken

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Replicate this code 500 times

  attrition_pop %>% 
    slice_sample(n = 20) %>% 
    summarize(mean_attrition = mean(Attrition == "Yes")) %>% 
    pull(mean_attrition)


# See the result
head(mean_attritions)
Code bewerken en uitvoeren