Reading the optimized plan
The naive plan you just printed is the starting point. Before running the query, Polars rewrites it: pushing filters down, dropping columns it doesn't need, and turning sort+head into a fast top-k. Use .explain() to see the optimized plan.
polars is loaded as pl, and the LazyFrame top_titles_by_format is the same query as before.
This exercise is part of the course
Scaling and Optimizing Data Pipelines with Polars
Exercise instructions
- Show the optimized plan for
top_titles_by_format.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Show the optimized plan
print(top_titles_by_format.____())