ComeçarComece de graça

Spine plot

The aggregation plot you have drawn in the previous exercise gave you some high-level overview of the missing data. If you are interested in the interaction between specific variables, a spine plot is the way to go. It allows you to study the percentage of missing values in one variable for different values of the other, which is conceptually very similar to the t-tests you have been running in the previous lesson.

In this exercise, you will draw a spine plot to investigate the percentage of missing data in earnings for different categories of sub_race. Is there more missing data on earnings for some specific races of the movie's main character? Let's find out! The VIM package has already been loaded for you.

Este exercício faz parte do curso

Handling Missing Data with Imputations in R

Ver curso

Exercício interativo prático

Experimente este exercício completando este código de exemplo.

# Draw a spine plot to analyse missing values in earnings by sub_race
biopics %>% 
	select(___, ___) %>%
	___()
Editar e executar o código