Summarizing digital checkouts
The reporting team needs the total number of digital checkouts by material type for this week's summary. Build the entire pipeline lazily and call .collect() only at the very end so Polars can plan the whole query at once.
Diese Übung ist Teil des Kurses
<Kurs>Scaling and Optimizing Data Pipelines with Polars</Kurs>Übungsanweisungen
- Filter
libraryto rows whereuseis"Digital". - Group the filtered rows by
format. - Trigger execution at the very end of the pipeline.
Interaktive praktische Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
result = (
library
# Filter to digital checkouts
.filter(pl.col("use") == "____")
# Group by format
.group_by("____")
.agg(pl.col("checkouts").sum().alias("total"))
.sort("total", descending=True)
# Trigger execution at the end
.____()
)
print(result)