Get startedGet started for free

Data Filtering with PySpark

In this exercise, we'll upload a file into a lakehouse and use PySpark to filter data. The file has a list of video games and their publishers. We want to reduce the dataset so we deal only with video games from the publisher Atari.

Note: If you get an error about an API rate limit, this usually happens because a previous Fabric task has not completed. You can view active Fabric tasks from the Monitor page (located on the left vertical menu). Canceling old Fabric tasks in the Monitor page will usually resolve the rate limit error.

This exercise is part of the course

Transform and Analyze Data with Microsoft Fabric

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise