Implementing Data Aggregation
In this exercise, we'll upload a file into a lakehouse and use PySpark to aggregate data. The file has a list of video games and their publishers. We want to produce a summary table that has the number of games and average critic rating per publisher.
Note: If you get an error about an API rate limit, this usually happens because a previous Fabric task has not completed. You can view active Fabric tasks from the Monitor page (located on the left horizontal menu). Canceling old Fabric tasks in the Monitor page will usually resolve the rate limit error.
This exercise is part of the course
Transform and Analyze Data with Microsoft Fabric
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
