NYC SAT Scores EDA
Math is a subject the U.S. is consistently behind the rest of the world on, so our experiments will focus on Math score. While the original dataset is an open dataset downloaded from Kaggle, throughout this chapter I will add a few variables that will allow you to pretend you are an education researcher conducting experiments ideally aimed at raising students' scores, hopefully increasing the likelihood they will be admitted to colleges.
Before diving into analyzing the experiments, we should do some EDA to make sure we fully understand the nyc_scores
data. In this lesson, we'll do experiments where we block by Borough
and Teacher_Education_Level
, so let's examine math scores by those variables. The nyc_scores
dataset has been loaded for you.
This exercise is part of the course
Experimental Design in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Mean, var, and median of Math score
nyc_scores %>%
___(___) %>%
___(mean = ___(___, na.rm = TRUE),
var = ___(___, na.rm = TRUE),
median = ___(___, na.rm = TRUE))