Identifying Duplicate Customers
Identifying Duplicate Customers is my five part series of Data Quality Pro articles providing a vendor-neutral data matching methodology for dealing with one of the most common data quality challenges. Topics covered in the series:
- Why a symbiosis of technology and methodology is necessary when approaching this challenge
- How performing a preliminary analysis on a representative data sample prepares effective examples for discussion
- Why using a detailed, interrogative analysis of those examples is imperative for defining your business rules
- How both false negatives and false positives illustrate the highly subjective nature of this problem
- How to document your business rules for identifying duplicate customers
- How to set realistic expectations about application development
- How to foster a collaboration of the business and technical teams throughout the entire project
- How to consolidate identified duplicates by creating a “best of breed” representative record
Complete Series of Articles
- Identifying Duplicate Customers (Part 1) – Series Introduction
- Identifying Duplicate Customers (Part 2) – False Negatives
- Identifying Duplicate Customers (Part 3) – False Positives
- Identifying Duplicate Customers (Part 4) – Best Practices
- Identifying Duplicate Customers (Part 5) – Duplicate Consolidation
Presentation and Study Guide
- Identifying Duplicate Customers (Presentation) – Adobe Acrobat Document (.pdf file) containing the presentation slides, which also include speaker notes as embedded comments.
- Identifying Duplicate Customers (Study Guide) – Adobe Acrobat Document (.pdf file) for the Data Quality Pro Study Guide.
Related Blog Posts
- OCDQ Radio - The Art of Data Matching — Featuring special guest Henrik Liliendahl Sørensen


