Exploring Databricks SQL

1. Exploring Databricks SQL

Hello there. In this video, I’ll be acting as a data analyst for a large coffee retail chain. My company recently adopted Databricks for our analytics, and I’m eager to familiarize myself with the platform. Upon logging into the Databricks UI, I notice several SQL-friendly sections. To run my queries, I’ll create a small SQL Warehouse, selecting a 2X-Small Serverless option for simplicity and performance during development. Next, I’ll open the SQL Editor to view my coding environment, where I see my dataset, *coffee_sales*, in the catalog, along with its column details. To start, I’ll write a classic `SELECT *` statement to return all records from this table. For convenience, I can click the three arrows next to the table name in the catalog pane to copy the correct syntax. Alternatively, I could write custom SQL queries. In this case, I want to create a bar chart visualizing the number of drinks sold by type to help forecast supplies. I’ll set the X-axis to drink type and use a count of rows as the aggregate on the Y-axis, then hit 'Save' to store my visual. Next, I’m interested in analyzing revenue by payment mode. I’ll create a donut chart, setting payment mode as the X-axis and summing revenue on the Y-axis. Again, I’ll save this visual. With two visuals ready, I’ll build a high-level dashboard to display them together. While I could use the Dashboards section in Databricks SQL, I’ll instead connect Power BI to Databricks. By navigating to Partner Connect in the left pane and selecting Power BI under the BI tools section, I can download a connection file to link directly to my SQL Warehouse and access the data. Databricks SQL enables efficient data analysis, supporting a wide range of tasks within the platform.

2. empty

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

This exercise is part of the course

Introduction to Databricks SQL

IntermediateSkill Level

4.7+

Start Course for Free

Build a strong data warehousing foundation with the Databricks Data Intelligence Platform! You will learn how Databricks and the lakehouse architecture set your organization up for a modern SQL and BI stack. You will also explore the key components of the Databricks SQL product, ranging from data storage techniques to compute optimizations.

Exercise 1: SQL in the Data Intelligence Platform Exercise 2: Benefits of the lakehouse for SQL Exercise 3: Exploring Databricks SQL

Current Exercise

Exercise 4: Exploring some data Exercise 5: Partner Connect Exercise 6: Databricks SQL key assets Exercise 7: Classifying Databricks SQL assets Exercise 8: Create a query Exercise 9: Build a visualization

Build a strong data foundation for the lakehouse architecture! You will learn how to ingest, transform, and model your data using the capabilities in Databricks SQL. You will explore GUI-based and programmatic approaches to building out the medallion architecture, and creating a data ecosystem ready for any analytics workload.

Exercise 1: Ingesting Data Exercise 2: GUI-based data ingest Exercise 3: Hydrating the lakehouse Exercise 4: Upload data manually Exercise 5: Using COPY INTO Exercise 6: Leveraging Auto Loader Exercise 7: Transforming data Exercise 8: SQL and the medallion architecture Exercise 9: Creating a coffee data layer Exercise 10: Cleaning up raw tables Exercise 11: Creating the silver layer Exercise 12: Creating a table of large premium claims

Create powerful data insights using Databricks SQL! You will learn how to write queries, create visualizations, and power dashboards using in-platform Databricks capabilities. You will practice leveraging SQL code, filters, and parameters to create a robust analytical application for your end users.

Exercise 1: Querying in the Data Intelligence Platform Exercise 2: Analytical assets in Databricks SQL Exercise 3: Querying our coffee dataset Exercise 4: Analyzing insurance claims Exercise 5: Supplementing queries with functions Exercise 6: Visualizing query results Exercise 7: Creating an analytics application Exercise 8: The role of Partner Connect Exercise 9: A coffee data dashboard Exercise 10: Adding filters to dashboards Exercise 11: Using parameters in queries Exercise 12: Create an executive dashboard

In the final chapter, you will learn some more advanced techniques that leverage the key differentiators of the Databricks platform. You will learn how to handle high-velocity and fast-changing data using window functions, and will be able to merge datasets as they come in.

Exercise 1: Common data engineering patterns Exercise 2: Append vs. CDC Exercise 3: Updating coffee sales data Exercise 4: Optimizing data with SQL Exercise 5: Using INSERT Exercise 6: Using MERGE Exercise 7: Advanced data analysis patterns Exercise 8: Understanding window functions and sub-queries Exercise 9: Analyzing coffee sales by store Exercise 10: Writing a sub-query Exercise 11: Using a window function Exercise 12: Using windows and sub-queries together Exercise 13: Course recap