Testing schemas
The team fixed the values in their expected DataFrame, but Polars inferred total_checkouts as 64-bit while the pipeline produces 32-bit. The two schemas are printed for you so you can see the mismatch. Fix the expected dtype, then assert the schemas match.
actual, expected, and assert_schema_equal are preloaded.
Deze oefening maakt deel uit van de cursus
Scaling and Optimizing Data Pipelines with Polars
Oefeninstructies
- Cast
total_checkoutsinexpectedtopl.Int32. - Assert that the schemas of
actualandexpectedare equal.
Interactieve oefening met praktijkervaring
Probeer deze oefening door deze voorbeeldcode aan te vullen.
# Match the dtype the pipeline produces
expected = expected.with_columns(
pl.col("total_checkouts").cast(pl.____)
)
# Schema-level assertion
____(actual.schema, expected.schema)
print("Schema assertion passed.")
print(actual.schema)