Serverless Data Processing with Dataflow - Writing an ETL pipeline using Apache Beam and Dataflow (Python)
Deze oefening maakt deel uit van de cursus
Serverless Data Processing with Dataflow: Develop Pipelines
Oefeninstructies
In this lab, you a) build a batch ETL pipeline in Apache Beam, which takes raw data from Google Cloud Storage and writes it to BigQuery b) run the Apache Beam pipeline on Dataflow and c) parameterize the execution of the pipeline.