Calculating means by category
A good way to explore categorical variables is to calculate summary statistics such as the mean for each category. Here, you'll look at grouped means for the house prices in the Taiwan real estate dataset.
taiwan_real_estate
is available and dplyr
is loaded.
This exercise is part of the course
Introduction to Regression in R
Exercise instructions
- Group
taiwan_real_estate
byhouse_age_years
. - Summarize to calculate the mean
price_twd_msq
for each group, naming the columnmean_by_group
. - Assign the result to
summary_stats
and look at the numbers.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
summary_stats <- taiwan_real_estate %>%
# Group by house age
___ %>%
# Summarize to calculate the mean house price/area
___(mean_by_group = ___)
# See the result
summary_stats