Serverless Data Processing with Dataflow - Writing an ETL pipeline using Apache Beam and Dataflow (Python)
This exercise is part of the course
Serverless Data Processing with Dataflow: Develop Pipelines
Exercise instructions
In this lab, you a) build a batch ETL pipeline in Apache Beam, which takes raw data from Google Cloud Storage and writes it to BigQuery b) run the Apache Beam pipeline on Dataflow and c) parameterize the execution of the pipeline.