Overplotting 4: Integer data
Let's take a look at the last case of dealing with overplotting:
- Integer data
This can be type integer
(i.e. 1 ,2, 3…) or categorical (i.e. class factor
) variables. factor
is just a special class of type integer
.
You'll typically have a small, defined number of intersections between two variables, which is similar to case 3, but you may miss it if you don't realize that integer and factor data are the same as low precision data.
The Vocab
dataset provided contains the years of education and vocabulary test scores from respondents to US General Social Surveys from 1972-2004.
This exercise is part of the course
Introduction to Data Visualization with ggplot2
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Examine the structure of Vocab
___
# Plot vocabulary vs. education
___ +
# Add a point layer
___