Top "Record-linkage" questions

Record linkage refers to the task of finding records in a data set that refer to the same entity when the entities do not have unique identifiers.

Using machine learning to de-duplicate data

I have the following problem and was thinking I could use machine learning but I'm not completely certain it will …

machine-learning duplicates mahout recommendation-engine record-linkage
Best machine learning approach to automate text/fuzzy matching

I'm reasonably new to machine learning, I've done a few projects in python. I'm looking for advice on how to …

machine-learning text-classification fuzzy-comparison record-linkage
How to apply machine learning to fuzzy matching

Let's say that I have an MDM system (Master Data Management), whose primary application is to detect and prevent duplication …

algorithm machine-learning fuzzy-comparison record-linkage
Fuzzy matching deduplication in less than exponential time?

I have a large database (potentially in the millions of records) with relatively short strings of text (on the order …

algorithm duplicates time-complexity fuzzy record-linkage
Data Deduplication algorithm for large number of contacts

I'm developing an application which must be able to find & merge duplicates in a Hundreds of thousands of contact …

algorithm duplicates fuzzy-logic record-linkage