CommencerCommencer gratuitement

NYC SAT Scores EDA

Math is a subject the U.S. is consistently behind the rest of the world on, so our experiments will focus on Math score. While the original dataset is an open dataset downloaded from Kaggle, throughout this chapter I will add a few variables that will allow you to pretend you are an education researcher conducting experiments ideally aimed at raising students' scores, hopefully increasing the likelihood they will be admitted to colleges.

Before diving into analyzing the experiments, we should do some EDA to make sure we fully understand the nyc_scores data. In this lesson, we'll do experiments where we block by Borough and Teacher_Education_Level, so let's examine math scores by those variables. The nyc_scores dataset has been loaded for you.

Cet exercice fait partie du cours

Experimental Design in R

Afficher le cours

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Mean, var, and median of Math score
nyc_scores %>%
    ___(___) %>% 
    ___(mean = ___(___, na.rm = TRUE),
        var = ___(___, na.rm = TRUE),
        median = ___(___, na.rm = TRUE))
Modifier et exécuter le code