Visualizing kNN distance score
The kNN distance score can be hard to interpret by simply eyeballing a set of values. It's helpful to use scatterplots to visualize the kNN distance score to understand how the score works. When interpreting the plot, the relative size of the kNN distance score is more informative than the absolute value.
The wine
data has been loaded with the kNN distance score appended from the previous exercise.
This exercise is part of the course
Introduction to Anomaly Detection in R
Exercise instructions
- Use a scatterplot to show
pH
andalcohol
on their original scales. - Supply an appropriate value to the
cex
argument so that each point is proportional in size to the square root of thescore
. - Adjust the plotting character argument
pch
so that points are shown as solid bullets.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Scatterplot showing pH, alcohol and kNN score
plot(pH ~ alcohol, ___, cex = ___, pch = ___)