Get startedGet started for free

Merging using PySpark

In this exercise, we'll upload three files into a lakehouse representing a snowflake video game dimension: video games in one file, genres in the second file, and publishers in the third file. We want to merge these into a single table using PySpark to perform the join.

Note: If you get an error about an API rate limit, this usually happens because a previous Fabric task has not completed. You can view active Fabric tasks from the Monitor page (located on the left vertical menu). Canceling old Fabric tasks in the Monitor page will usually resolve the rate limit error.

This exercise is part of the course

Transform and Analyze Data with Microsoft Fabric

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise