Analyzing data with SQL
Continuing with our online_retail dataset, your manager wants to understand cancellation patterns across different countries. Since SQL is often more readable for aggregations and grouping, you'll query the data using SQL and inspect the execution plan to verify how Spark processes your query.
In this exercise, you'll create a temporary view from a filtered DataFrame, run a SQL aggregation, and examine the query plan.
This exercise is part of the course
Data Transformation with Spark SQL in Databricks
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise