Exercise 2. Percentiles
Suppose we can't make a plot and want to compare the distributions side by side. If the number of data points is large, listing all the numbers is inpractical. A more practical approach is to look at the percentiles. We can obtain percentiles using the quantile
function like this
library(dslabs)
data(heights)
quantile(heights$height, seq(.01, 0.99, 0.01))
This exercise is part of the course
Data Science Visualization - Module 2
Exercise instructions
- Create two five row vectors showing the 10th, 30th, 50th, 70th, and 90th percentiles for the heights of each sex called these vectors
female_percentiles
andmale_percentiles
. - Then create a data frame called
df
with these two vectors as columns. The column names should befemale
andmale
and should appear in that order. As an example consider that if you want a data frame to have column namesnames
andgrades
, in that order, you do it like this:
df <- data.frame(names = c("Jose", "Mary"), grades = c("B", "A"))
- Take a look at the
df
by printing it. This will provide some information on how male and female heights differ.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
library(dslabs)
data(heights)
male <- heights$height[heights$sex=="Male"]
female <- heights$height[heights$sex=="Female"]