Calculating means by category
A good way to explore categorical variables is to calculate summary statistics such as the mean for each category. Here, you'll look at grouped means for the house prices in the Taiwan real estate dataset.
taiwan_real_estate is available and dplyr is loaded.
This exercise is part of the course
Introduction to Regression in R
Exercise instructions
- Group
taiwan_real_estatebyhouse_age_years. - Summarize to calculate the mean
price_twd_msqfor each group, naming the columnmean_by_group. - Assign the result to
summary_statsand look at the numbers.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
summary_stats <- taiwan_real_estate %>%
# Group by house age
___ %>%
# Summarize to calculate the mean house price/area
___(mean_by_group = ___)
# See the result
summary_stats