CommencerCommencer gratuitement

Quiz Question 4

You are building a new pipeline using Serverless for Apache Spark. The source data is highly structured, with well-defined columns like userid and purchaseamount. The primary task involves complex filtering and calculations on these columns. According to modern Spark best practices, which core API should you use to represent and manipulate this data?

Cet exercice fait partie du cours

Build Batch Data Pipelines on Google Cloud

Afficher le cours

Exercice interactif pratique

Passez de la théorie à la pratique avec l’un de nos exercices interactifs

Commencer l’exercice