Calculating means by category
A good way to explore categorical variables is to calculate summary statistics such as the mean for each category. Here, you'll look at grouped means for the house prices in the Taiwan real estate dataset.
taiwan_real_estate is available and dplyr is loaded.
Deze oefening maakt deel uit van de cursus
Introduction to Regression in R
Oefeninstructies
- Group
taiwan_real_estatebyhouse_age_years. - Summarize to calculate the mean
price_twd_msqfor each group, naming the columnmean_by_group. - Assign the result to
summary_statsand look at the numbers.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
summary_stats <- taiwan_real_estate %>%
# Group by house age
___ %>%
# Summarize to calculate the mean house price/area
___(mean_by_group = ___)
# See the result
summary_stats