Record linkage, or exact matching, can be used to join together two files that contain information on the same individuals, but lack unique personal identification codes. The possibility of errors in linkage causes problems for estimating the relationships between variables on the two files. The effect is analogous to the impact of measurement error. A model of a linear regression relationship between variables in linked files is proposed. Assuming the probabilities that pairs of records are links are known, an unbiased estimator of the regression coefficients is derived. Methods for estimating the linkage probabilities by using mixture models are discussed. A consistent estimator of the covariance matrix of the proposed estimator is propos...
Probabilistic matching of records is widely used to create linked data sets for use in health scienc...
Probabilistic record linkage allows the assembling of information from different data sources. We pr...
With increasing availability of large datasets derived from administrative and other sources, there ...
In this paper we have described and extended some recent proposals on a general Bayesian methodology...
Record linkage brings together information from records in two or more data sources that are believe...
Record linkage methods help us combine multiple data sets from different sources when a single data ...
Data linkage is increasingly being used to combine data from different sources with the aim of ident...
In many healthcare and social science applications, information about units is dispersed across mult...
In this paper we have described and extended some recent proposals on a general Bayesian methodology...
There is growing interest in a data integration approach to survey sampling, particularly where popu...
Most probability-based methods used to link records from two distinct data sets correspond- ing to t...
Record linkage is the act of bringing together records that are believed to belong to the same unit ...
Record linkage involves a number of different linking methods to link records from one or more data ...
Linked data sets are often multi-linked, i.e. they are created by matching records from three or mor...
Data linkage is the act of bringing together records that are believed to belong to the same unit (e...
Probabilistic matching of records is widely used to create linked data sets for use in health scienc...
Probabilistic record linkage allows the assembling of information from different data sources. We pr...
With increasing availability of large datasets derived from administrative and other sources, there ...
In this paper we have described and extended some recent proposals on a general Bayesian methodology...
Record linkage brings together information from records in two or more data sources that are believe...
Record linkage methods help us combine multiple data sets from different sources when a single data ...
Data linkage is increasingly being used to combine data from different sources with the aim of ident...
In many healthcare and social science applications, information about units is dispersed across mult...
In this paper we have described and extended some recent proposals on a general Bayesian methodology...
There is growing interest in a data integration approach to survey sampling, particularly where popu...
Most probability-based methods used to link records from two distinct data sets correspond- ing to t...
Record linkage is the act of bringing together records that are believed to belong to the same unit ...
Record linkage involves a number of different linking methods to link records from one or more data ...
Linked data sets are often multi-linked, i.e. they are created by matching records from three or mor...
Data linkage is the act of bringing together records that are believed to belong to the same unit (e...
Probabilistic matching of records is widely used to create linked data sets for use in health scienc...
Probabilistic record linkage allows the assembling of information from different data sources. We pr...
With increasing availability of large datasets derived from administrative and other sources, there ...