Get startedGet started for free

NYC SAT Scores EDA

Math is a subject the U.S. is consistently behind the rest of the world on, so our experiments will focus on Math score. While the original dataset is an open dataset downloaded from Kaggle, throughout this chapter I will add a few variables that will allow you to pretend you are an education researcher conducting experiments ideally aimed at raising students' scores, hopefully increasing the likelihood they will be admitted to colleges.

Before diving into analyzing the experiments, we should do some EDA to make sure we fully understand the nyc_scores data. In this lesson, we'll do experiments where we block by Borough and Teacher_Education_Level, so let's examine math scores by those variables. The nyc_scores dataset has been loaded for you.

This exercise is part of the course

Experimental Design in R

View Course

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Mean, var, and median of Math score
nyc_scores %>%
    ___(___) %>% 
    ___(mean = ___(___, na.rm = TRUE),
        var = ___(___, na.rm = TRUE),
        median = ___(___, na.rm = TRUE))
Edit and Run Code