Get startedGet started for free

Setting up a Databricks workspace example

1. Setting up a Databricks workspace example

Hello there! In this video, I will be showing you an example of how to get started with your Databricks environment, as well as providing an overview of the platform. In this scenario, I will be acting the part of the lead cloud data architect at Sierra Publishing. I have already setup our Databricks workspace so that our teams can start to experiment with the platform and use it for their analytical workloads. Now that I am in the Databricks UI, let us take a moment to explore what is available to us. Looking at the left-hand side of the screen, I can see all of the different components of the platform. On the top we have all of the general capabilitieslike like our compute clusters or data access. Below that, I can see there are a variety of capabilities designed for different data workloads, such as Delta Live Tables of the Model Registry. In later videos, we will be diving deeper into these other capabilities, so I will skip them for now. Since I am both an Account Admin and a Workspace Admin, I can look at a variety of settings available to me. To see setting, I can click on the drop-down under my username in the top-right corner. Starting with the workspace settings by clicking the Admin Settings under my name, I can see I have control over many different aspects of the workspace, such as users, permissions, and specific configurations for our compute resources. Moving on, I can now take a look at the Account Console by clicking on the Manage Account option under my username. Once in the Account Console, I can look through different components of our account, such as all of our deployed workspaces in the workspaces section, and our data in Unity Catalogs under the data section. Let’s go ahead and create a new Unity Catalog where we can host external datasets. In the Data section of the Account Console, I can create a new Catalog. In this view, I have a few key components to setup. First, I need to provide a name for my catalog. Next, I will need to provide some basic information about a storage location for the catalog. This will define a bucket in my data lake where Unity Catalog can store metadata and managed Delta tables within my cloud subscription. Finally, I will have to provide a managed identity for Databricks to access the storage location securely. With all of that setup, I have created a new catalog that I can manage a new set of data. Now that I am familiar with my overall Databricks administration, let’s jump back into the course and see how we can get this setup in your own environment.

2. Let's practice!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.