Aan de slagBegin gratis

Summarizing digital checkouts

The reporting team needs the total number of digital checkouts by material type for this week's summary. Build the entire pipeline lazily and call .collect() only at the very end so Polars can plan the whole query at once.

Deze oefening maakt deel uit van de cursus

Scaling and Optimizing Data Pipelines with Polars

Bekijk cursus

Oefeninstructies

  • Filter library to rows where use is "Digital".
  • Group the filtered rows by format.
  • Trigger execution at the very end of the pipeline.

Interactieve oefening met praktijkervaring

Probeer deze oefening door deze voorbeeldcode aan te vullen.

result = (
    library
    # Filter to digital checkouts
    .filter(pl.col("use") == "____")
    # Group by format
    .group_by("____")
    .agg(pl.col("checkouts").sum().alias("total"))
    .sort("total", descending=True)
    # Trigger execution at the end
    .____()
)
print(result)
Code bewerken en uitvoeren