Aggregating sales data with SQL
Most production pipelines push computation down to the database where the data already lives. Your first SQL pipeline does exactly that: it aggregates e-commerce orders by date inside DuckDB, then reads the totals back so you can verify the result.
The duckdb_analytics connection, the raw_orders source, and an empty daily_summary table are set up for you.
This exercise is part of the course
Building Data Pipelines with Airflow
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise