1. Learn
  2. /
  3. Courses
  4. /
  5. Dealing With Missing Data in R

Exercise

Impute data below range with nabular data

We want to keep track of values we imputed. If we don't, it is very difficult to assess how good the imputed values are.

We are going to practice imputing data and recreate visualizations in the previous set of exercises by imputing values below the range of the data.

This is a very useful way to help further explore missingness, and also provides the framework for imputing missing values.

First, we are going to impute the data below the range using impute_below_all(), and then visualize the data. We notice that although we can see where the missing values are in this instance, we need some way to track them. The track missing data programming pattern can help with this.

Instructions

100 XP

Using the oceanbuoys data:

  • Impute below the range using impute_below_all().
  • Visualize the new missing values for wind_ew on the x-axis and air_temp_c on the y-axis.
  • Impute and track data with bind_shadow(), impute_below_all(), and add_label_shadow().
  • Show the plot and inspect the imputed values.