This paper provides a mechanism for automatically estimating record linkage false match rates in situations where the subset of the true matches is reasonably well separated from other pairs and there is no training data. The method provides an alternative to the method of Belin and Rubin (JASA 1995) and is applicable in more situations. We provide examples demonstrating why the general problem of error rate estimation (both false match and false nonmatch rates) is likely impossible in situations without training data and exceptionally difficult even in the extremely rare situations when training data are available
Data linkage is increasingly being used to combine data from different sources with the aim of ident...
Record linkage aims at quickly and accurately identifying if two records represent the same real wor...
Record linkage addresses the problem of identifying pairs of records coming from different sources a...
This paper provides a mechanism for automatically estimating record linkage false match rates in sit...
AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of...
Record linkage is the process of identifying and linking records about the same entities from one or...
Probabilistic record linkage can be an effective research technique even if available records lack s...
Record linkage involves a number of different linking methods to link records from one or more data ...
This chapter explains different proposals that adjust the capture-recapture estimation by explicitly...
Probabilistic record linkage techniques assign match weights to one or more potential matches for th...
The field of Record Linkage is concerned with identifying records from one or more datasets which re...
Linking records from two or more databases is an increasingly important data preparation step in man...
1. Abstract. Fellegi and Sunter (1969) developed an algorithm for linking records. Each pair of reco...
Linking or matching databases is becoming increasingly important in many data mining projects, as li...
Probabilistic record linkage allows the assembling of information from different data sources. We pr...
Data linkage is increasingly being used to combine data from different sources with the aim of ident...
Record linkage aims at quickly and accurately identifying if two records represent the same real wor...
Record linkage addresses the problem of identifying pairs of records coming from different sources a...
This paper provides a mechanism for automatically estimating record linkage false match rates in sit...
AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of...
Record linkage is the process of identifying and linking records about the same entities from one or...
Probabilistic record linkage can be an effective research technique even if available records lack s...
Record linkage involves a number of different linking methods to link records from one or more data ...
This chapter explains different proposals that adjust the capture-recapture estimation by explicitly...
Probabilistic record linkage techniques assign match weights to one or more potential matches for th...
The field of Record Linkage is concerned with identifying records from one or more datasets which re...
Linking records from two or more databases is an increasingly important data preparation step in man...
1. Abstract. Fellegi and Sunter (1969) developed an algorithm for linking records. Each pair of reco...
Linking or matching databases is becoming increasingly important in many data mining projects, as li...
Probabilistic record linkage allows the assembling of information from different data sources. We pr...
Data linkage is increasingly being used to combine data from different sources with the aim of ident...
Record linkage aims at quickly and accurately identifying if two records represent the same real wor...
Record linkage addresses the problem of identifying pairs of records coming from different sources a...