Deduping
Now that you have all the raw data in one file, you can continue preparing the dataset for its final use. The IT department sent you completely raw data, so a good next step is to search and remove all duplicate rows that could distort any final analysis.
The Customers
and Orders
sheets will act like reference tables and help provide more details on the orders placed. Both sheets contain identification numbers that should be unique. Let's make sure that is the case!
If you lost progress, close any open reports and load 1_2_deduping.xlsx
from the Workbooks folder. If a message appears saying "Security Warning - External Data Connections have been disabled", click on Enable Content and continue.
This exercise is part of the course
Data Preparation in Excel
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
