Ignoring parse errors
A bad value, the literal text "unknown", has slipped into the checkouts column of the vendor export. Polars normally fails when it can't parse a value into the inferred dtype. Tell Polars to skip these errors so the team can still load the rest of the data.
A normal row and the row with the bad value are printed for you so you can see what's going on.
Diese Übung ist Teil des Kurses
<Kurs>Scaling and Optimizing Data Pipelines with Polars</Kurs>Übungsanweisungen
- Add the argument that tells Polars to set bad values to
nulland continue scanning.
Interaktive praktische Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
result = pl.scan_csv(
MESSY_CSV_PATH,
separator=";",
skip_rows=2,
schema_overrides={
"checkouts": pl.Int64,
"branch_code": pl.String,
},
# Tolerate values that don't fit the schema
____=____,
).collect()
print(result)