To link or not to link?
Similar to joins, record linkage is the act of linking data from different sources regarding the same entity. But unlike joins, record linkage does not require exact matches between different pairs of data, and instead can find close matches using string similarity. This is why record linkage is effective when there are no common unique keys between the data sources you can rely upon when linking data sources such as a unique identifier.
In this exercise, you will classify each card whether it is a traditional join problem, or a record linkage one.
This exercise is part of the course
Cleaning Data in Python
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
