Validating data transformations
Great work so far! Manually spot-checking transformations is a great first step to ensuring that you're maintaining data quality throughout a pipeline. pandas offers several built-in functions to help you with just that!
To help get you started with this exercise, pandas has been imported as pd.
Deze oefening maakt deel uit van de cursus
ETL and ELT in Python
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
def extract(file_path):
# Ingest the data to a DataFrame
raw_data = pd.____(____)
# Return the DataFrame
return raw_data
raw_sales_data = extract("sales_data.parquet")