Get startedGet started for free

Analyzing data with SQL

Continuing with our online_retail dataset, your manager wants to understand cancellation patterns across different countries. Since SQL is often more readable for aggregations and grouping, you'll query the data using SQL and inspect the execution plan to verify how Spark processes your query.

In this exercise, you'll create a temporary view from a filtered DataFrame, run a SQL aggregation, and examine the query plan.

This exercise is part of the course

Data Transformation with Spark SQL in Databricks

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise