LoslegenKostenlos loslegen

Variable-wise imputation errors

In the previous exercise you have extracted the estimated imputation errors from missForest's output. This gave you two numbers:

  • the normalized root mean squared error (NRMSE) for all continuous variables;
  • the proportion of falsely classified entries (PFC) for all categorical variables.

However, it could well be that the imputation model performs great for one continuous variable and poor for another! To diagnose such cases, it is enough to tell missForest to produce variable-wise error estimates. This is done by setting the variablewise argument to TRUE.

The biopics data and missForest package have already been loaded for you, so let's take a closer look at the errors!

Diese Übung ist Teil des Kurses

Handling Missing Data with Imputations in R

Kurs anzeigen

Interaktive Übung

Vervollständige den Beispielcode, um diese Übung erfolgreich abzuschließen.

# Impute biopics data with missForest computing per-variable errors
imp_res <- ___(___, ___ = ___)
Code bearbeiten und ausführen