Implementing Data Denormalization

In most cases, dimension tables should be denormalized. In this exercise, we'll use PySpark to load two dimension tables, products and categories, and denormalize the products table by replacing the categoryID column with the corresponding categoryName from the table categories.

Note: If you get an error about an API rate limit, this usually happens because a previous Fabric task has not completed. You can view active Fabric tasks from the Monitor page (located on the left vertical menu). Canceling old Fabric tasks in the Monitor page will usually resolve the rate limit error.

This exercise is part of the course

Transform and Analyze Data with Microsoft Fabric

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise