Descriptive statistics and function masking
Previously, you have seen that the summary()
function computes the mean, median, 1st, and 3rd quartiles plus the min and max. The standard deviation is not included. However, the Hmisc
and psych
packages both have a describe()
function that provide more descriptive statistics including the standard deviation.
To get these descriptive statistics, you will run the describe()
function which has the same name in both packages. To avoid confusion, you should always use the notation packagename::functionname()
to explicitly specify the function from the package you want.
The abaloneKeep
dataset and dplyr
package are already loaded for you.
This exercise is part of the course
R For SAS Users
Exercise instructions
- Load
Hmisc
package - Get descriptive stats for abalone sex, length, diameter, and height using
Hmisc::describe()
function. - Load
psych
package - notice the warnings about thedescribe()
function masked fromHmisc
package (since both packages have adescribe()
function). - Run
describe()
function frompsych
package to get descriptive stats for abalone length, diameter, height. Leave sex out sincepsych::describe()
only works for numeric variables.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Load the Hmisc package
library(___)
# Run describe() from Hmisc for sex, length, diameter, height
abaloneKeep %>%
select(___, ___, ___, ___) %>%
___::___
# Load the psych package
___
# Run describe() from psych for length, diameter, height
___ %>%
___ %>%
___::___