A first analysis
As a first step in the analysis, you should consider summaries of the data. This can be done using the summary
command.
The variable summaries tell you which variables are categorical and which are numerical. Think about the types of the variables. If you aren't sure or want to take a closer look at the data, make a graph.
We will first start with analyzing the weight gained by mothers throughout the pregnancy: gained.
This exercise is part of the course
Data Analysis and Statistical Inference
Exercise instructions
- Compute a summary of the data set using the
summary()
function. - Consider which variables are categorical and which are numerical.
- Using visualization and summary statistics, describe the distribution of weight gained by mothers during pregnancy (gained). Specifically look for outliers.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# The nc data frame is already loaded into the workspace
# Compute summaries of the data:
# Use visualization and summary statistics to view the data for gained: