Reading from a database
As a data engineer for Sierra Publishing, you often need to reference datasets from other groups within your organization. In this case, you need to read data relating to recent marketing campaigns, which is stored on a Postgres database in the cloud. Historically you have logged into the database and manually run the queries, but you would like to pull that data into a Databricks notebook and process the data with a Spark DataFrame.
What is your best approach if you only want to read the data directly from the database?
This exercise is part of the course
Databricks Concepts
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
