Aan de slagGa gratis aan de slag

Calculating statistic of interest

Using the NHANES dataset, let's investigate the relationship between gender and home ownership. Remember, more information about the dataset can be found here: NHANES.

As seen in the video, natural variability can be modeled from shuffling observations around to remove any relationships that might exist in the population. However, before you permute the data, you need to calculate the original observed statistic. In this exercise, you will calculate the difference in proportion of home owners who are men versus women.

Recall that:

  • %in% returns a logical vector that is TRUE when values on the left hand side are listed on the right hand side.
  • The mean of a logical vector is the proportion of cases where that vector is TRUE.
fruits <- c("apple", "banana", "cherry")
fruits %in% c("banana", "cherry")
mean(fruits %in% c("banana", "cherry"))

Deze oefening maakt deel uit van de cursus

Foundations of Inference in R

Cursus bekijken

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

homes <- NHANES %>%
  # Select Gender and HomeOwn
  ___(___, ___) %>%
  # Filter for HomeOwn equal to "Own" or "Rent"
  ___(___ %in% c("___", "___"))
Code bewerken en uitvoeren