Column distribution and duplicates
We can use the Column distribution feature to check our columns for the number of unique values as well as number of distinct categories. This can give us a great indication for which columns contain any duplicate values, and which columns might have the wrong number of categories.
More information about working with duplicates can be found in this MSFT Learn article.
Our manager has asked us to check on the Color column in our dataset, there seems to be an error there because someone mistyped some data. He is sure that we only stock 10 different colors of products (including the products which have no color). Use the features in Power Query to help verify and fix the dataset.
If you lost any progress, start by loading the workbook 2_1_column_distribution.pbix from the Exercises folder on the Desktop and open the Power Query editor.
This exercise is part of the course
Data Preparation in Power BI
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise