Visualizing numeric vs. categorical
If the explanatory variable is categorical, the scatter plot that you used before to visualize the data doesn't make sense. Instead, a good option is to draw a histogram for each category.
The Taiwan real estate dataset has a categorical variable in the form of the age of each house. The ages have been split into 3 groups: 0 to 15 years, 15 to 30 years, and 30 to 45 years.
taiwan_real_estate
is available and ggplot2
is loaded.
This exercise is part of the course
Introduction to Regression in R
Exercise instructions
- Using
taiwan_real_estate
, plot a histogram ofprice_twd_msq
with10
bins. - Facet the plot by
house_age_years
to give 3 panels.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Using taiwan_real_estate, plot price_twd_msq
___ +
# Make it a histogram with 10 bins
___ +
# Facet the plot so each house age group gets its own panel
___