Ignoring parse errors
A bad value, the literal text "unknown", has slipped into the checkouts column of the vendor export. Polars normally fails when it can't parse a value into the inferred dtype. Tell Polars to skip these errors so the team can still load the rest of the data.
A normal row and the row with the bad value are printed for you so you can see what's going on.
Este exercicio faz parte do curso
Scaling and Optimizing Data Pipelines with Polars
Instruções do exercicio
- Add the argument that tells Polars to set bad values to
nulland continue scanning.
exercicio interativo prático
Tente este exercicio completando este código de exemplo.
result = pl.scan_csv(
MESSY_CSV_PATH,
separator=";",
skip_rows=2,
schema_overrides={
"checkouts": pl.Int64,
"branch_code": pl.String,
},
# Tolerate values that don't fit the schema
____=____,
).collect()
print(result)