Data pipeline steps
As the lead Data Engineer at Sierra Publishing, you and your team of Data Engineers have been asked to clean book review data into an analytic-ready dataset for your analysts and data scientists.
Since you get data daily from your various publishing partners, you must create an automated and reliable data pipeline for your downstream data consumers.
This exercise is part of the course
Databricks Concepts
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
