Exercise 5. group_by

Now let's practice using the group_by function.

What we are about to do is a very common operation in data science: you will split a data table into groups and then compute summary statistics for each group.

We will compute the average and standard deviation of systolic blood pressure for females for each age group separately. Remember that the age groups are contained in AgeDecade.

Use the functions filter, group_by, summarize, and the pipe %>% to compute the average and standard deviation of systolic blood pressure for females for each age group separately.
Within summarize, save the average and standard deviation of systolic blood pressure (BPSysAve) as average and standard_deviation.
Note: ignore warnings about implicit NAs. This warning will not prevent your code from running or being graded correctly.

Data Types

Quantiles, Percentiles, and Boxplots

Distributions

Normal Distributions

Robust Summaries with Outliers

Introduction to ggplot2

Summarizing with dplyr

Exploring the gapminder dataset

Data Visualization Principles - Part 1

Data Visualization Principles - Part 2

Data Visualization Principles - Part 3

Exercice

Exercise 5. group_by

Instructions