Analyzing data with SQL
Continuing with our online_retail dataset, your manager wants to understand cancellation patterns across different countries. Since SQL is often more readable for aggregations and grouping, you'll query the data using SQL and inspect the execution plan to verify how Spark processes your query.
In this exercise, you'll create a temporary view from a filtered DataFrame, run a SQL aggregation, and examine the query plan.
Deze oefening maakt deel uit van de cursus
Data Transformation with Spark SQL in Databricks
Praktische interactieve oefening
Zet theorie om in actie met een van onze interactieve oefeningen.
Begin met trainen