AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of data is crucial, for example in disease registries and medical research networks. Record linkage provides methods for minimizing synonym and homonym errors thereby improving data quality. We focus our attention to the case of homonym errors (in the following denoted as ‘false matches’), in which records belonging to different entities are wrongly classified as equal. Synonym errors (‘false non-matches’) occur when a single entity maps to multiple records in the linkage result. They are not considered in this study because in our application domain they are not as crucial as false matches. False match rates are frequently computed manually th...
Probabilistic record linkage techniques assign match weights to one or more potential matches for th...
Record linkage is the process of identifying and linking records about the same entities from one or...
Objectives: In the absence of unique ID numbers, cancer and other registries in Germany and elsewhe...
This paper provides a mechanism for automatically estimating record linkage false match rates in sit...
We consider the problem of duplicate detection in noisy and incomplete data: given a large data set ...
AbstractIntroductionExisting record linkage methods do not handle missing linking field values in an...
AbstractProbabilistic record linkage is a method commonly used to determine whether demographic reco...
Several statistical packages, either commercials or open-source, provide many methods for multi-fact...
Data analysis requires data to be of a high quality. Unfortunately this is not always the case, esp...
Linkage of medical databases, including insurer claims and electronic health records (EHRs), is incr...
Duplicate patient records in health information systems have received increased attention in recent ...
Record linkage is the task of identifying which records from different data sources refer to the sam...
Record linkage brings together information from records in two or more data sources that are believe...
Background: Linkage of electronic healthcare records is becoming increasingly important for research...
© 2016 The authors and IOS Press. Objectives: To develop and test an optimal ensemble configuration ...
Probabilistic record linkage techniques assign match weights to one or more potential matches for th...
Record linkage is the process of identifying and linking records about the same entities from one or...
Objectives: In the absence of unique ID numbers, cancer and other registries in Germany and elsewhe...
This paper provides a mechanism for automatically estimating record linkage false match rates in sit...
We consider the problem of duplicate detection in noisy and incomplete data: given a large data set ...
AbstractIntroductionExisting record linkage methods do not handle missing linking field values in an...
AbstractProbabilistic record linkage is a method commonly used to determine whether demographic reco...
Several statistical packages, either commercials or open-source, provide many methods for multi-fact...
Data analysis requires data to be of a high quality. Unfortunately this is not always the case, esp...
Linkage of medical databases, including insurer claims and electronic health records (EHRs), is incr...
Duplicate patient records in health information systems have received increased attention in recent ...
Record linkage is the task of identifying which records from different data sources refer to the sam...
Record linkage brings together information from records in two or more data sources that are believe...
Background: Linkage of electronic healthcare records is becoming increasingly important for research...
© 2016 The authors and IOS Press. Objectives: To develop and test an optimal ensemble configuration ...
Probabilistic record linkage techniques assign match weights to one or more potential matches for th...
Record linkage is the process of identifying and linking records about the same entities from one or...
Objectives: In the absence of unique ID numbers, cancer and other registries in Germany and elsewhe...