Exercise

Multivariate outlier detection

100 persons living in the same area have filed a claim because their houses were damaged by hail from Sunday night's storm. The dataset hailinsurance contains 100 observation and 2 variables. The first column contains the payments that were done by the insurance company to each customer whereas the second column is the most recent house price.

In this exercise, you're first going to use classical estimators on the dataset. You will then compare the results with those of robust estimators.

Instructions 1/2

undefined XP
    1
    2
  • Compute the sample mean and sample covariance matrix of both variables.
  • Add a 97.5% tolerance ellipsoid using these classical estimators.