Get startedGet started for free

Spine plot

The aggregation plot you have drawn in the previous exercise gave you some high-level overview of the missing data. If you are interested in the interaction between specific variables, a spine plot is the way to go. It allows you to study the percentage of missing values in one variable for different values of the other, which is conceptually very similar to the t-tests you have been running in the previous lesson.

In this exercise, you will draw a spine plot to investigate the percentage of missing data in earnings for different categories of sub_race. Is there more missing data on earnings for some specific races of the movie's main character? Let's find out! The VIM package has already been loaded for you.

This exercise is part of the course

Handling Missing Data with Imputations in R

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Draw a spine plot to analyse missing values in earnings by sub_race
biopics %>% 
	select(___, ___) %>%
	___()
Edit and Run Code