CommencerCommencer gratuitement

Tabulating Missingness

The summaries of missingness we just calculated give us the number and percentage of missing observations for the cases and variables.

Another way to summarize missingness is by tabulating the number of times that there are 0, 1, 2, 3, missings in a variable, or in a case.

In this exercise we are going to tabulate the number of missings in each case and variable using miss_var_table() and miss_case_table(), and also combine these summaries with the the group_by operator from dplyr. to explore the summaries over a grouping variable in the dataset.

Cet exercice fait partie du cours

Dealing With Missing Data in R

Afficher le cours

Instructions

For the airquality dataset:

  • Tabulate missingness for each variable using miss_var_table().
  • Tabulate missingness for every case using miss_case_table().
  • Combine previous tabulations with dplyr's group_by() function to create tabulations for each variable and case, by each Month.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Tabulate missingness in each variable and case of the `airquality` dataset
___(airquality)
___(___)

# Tabulate the missingness in each variable, grouped by Month, in the `airquality` dataset
airquality %>% group_by(___) %>% miss_var_table()

# Tabulate of missingness in each case, grouped by Month, in the `airquality` dataset
airquality %>% ___ %>% miss_case_table()
Modifier et exécuter le code