The process of identifying duplicate records without common identifiers within or across systems (matching) and reconciling them (merging).
Added Perspectives
Give careful thought to record-matching methods. The deterministic approach to matching record values uses rules to compare attribute values and identify either an exact match or approximate match after values are standardized. Probabilistic matching, in contrast, estimates the likelihood of a match (i.e., similarity score) based on parameters such as the frequency with which data values appear across many records. Similarity scores above a certain threshold for a given pair of records are deemed to be a match. Probabilistic approaches, when designed and configured appropriately, often more flexibly accommodate higher varieties of data. Enterprises can select deterministic, probabilistic or heuristic approaches, or a mix of them depending on their dataset characteristics and vendor-specific capabilities.
- Kevin Petrie in Master Data Management: A Modern Guide for Data Governance Professionals
November 15, 2019 (Blog)
Relevant Content
Sep 19, 2016 - A main goal of Master Data Management has been to make sense of all the data that touches the organization and generate trusted, ready-to-go master...
Related Terms
Unleash The Power Of Your Data
Providing modern, comprehensive data solutions so you can turn data into your most powerful asset and stay ahead of the competition.
Learn how we can help your organization create actionable data strategies and highly tailored solutions.
© Datalere, LLC. All rights reserved