Aan de slagGa gratis aan de slag

Filling missing values with pandas

When building data pipelines, it's inevitable that you'll stumble upon missing data. In some cases, you may want to remove these records from the dataset. But in others, you'll need to impute values for the missing information. In this exercise, you'll practice using pandas to impute missing test scores.

Data from the file "testing_scores.json" has been read into a DataFrame, and is stored in the variable raw_testing_scores. In addition to this, pandas has been loaded as pd.

Deze oefening maakt deel uit van de cursus

ETL and ELT in Python

Cursus bekijken

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

# Print the head of the `raw_testing_scores` DataFrame
print(raw_testing_scores.____)
Code bewerken en uitvoeren