Automatic data editing consists of three components: identification of erroneous records, identification of most likely erroneous fields within an erroneous record (fields to impute), and assignment of acceptable values to failing records. Moreover the types of data considered naturally fall into three categories: coded (categorical) data, continuous data, and mixed data (both coded and continuous). For the case of coded data, a natural way to approach automatic data is commonly referred to as the Boolean approach, first developed by Fellegi and Holt. For the fields to impute problem, central to the operation of the Fellegi-Holt approach is the explicit recognition of certain implied edits; Fellegi and Holt orginally required a complete set...
Item-nonresponse is often treated by means of an imputation technique. In some cases, the data have ...
Data cleaning is an action which includes a process of correcting and identifying the inconsistencie...
The paper is concerned with the problem of automatic detection and correction of inconsistent or out...
Data editing is the process by which data that are collected in some way (a statistical survey for e...
Data collected by statistical offices generally contain errors, which have to be corrected before re...
Data editing is the process by which data that are collected in some way (a statistical survey for ...
The Fellegi-Holt method automatically "corrects" data that fail some predefined requiremen...
The Fellegi-Holt method automatically "corrects" data that fail some predefined requirements. Comput...
Statistical Data Editing (SDE) is the process of checking and correcting data for errors. Winkler (1...
In a variety of relevant real world problems, tasks of "data mining" and "knowledge discovery" are r...
Item-nonresponse is often treated by means of an imputation technique. In some cases, the data have ...
International audienceThis paper is about editing Boolean classifiers, i.e., determining how a Boole...
<div><p>Many statistical organizations collect data that are expected to satisfy linear constraints;...
In categorical data, it is typically the case that some combinations of variables are theo-retically...
This paper presents some theoretical findings from our recent methodological research addressing the...
Item-nonresponse is often treated by means of an imputation technique. In some cases, the data have ...
Data cleaning is an action which includes a process of correcting and identifying the inconsistencie...
The paper is concerned with the problem of automatic detection and correction of inconsistent or out...
Data editing is the process by which data that are collected in some way (a statistical survey for e...
Data collected by statistical offices generally contain errors, which have to be corrected before re...
Data editing is the process by which data that are collected in some way (a statistical survey for ...
The Fellegi-Holt method automatically "corrects" data that fail some predefined requiremen...
The Fellegi-Holt method automatically "corrects" data that fail some predefined requirements. Comput...
Statistical Data Editing (SDE) is the process of checking and correcting data for errors. Winkler (1...
In a variety of relevant real world problems, tasks of "data mining" and "knowledge discovery" are r...
Item-nonresponse is often treated by means of an imputation technique. In some cases, the data have ...
International audienceThis paper is about editing Boolean classifiers, i.e., determining how a Boole...
<div><p>Many statistical organizations collect data that are expected to satisfy linear constraints;...
In categorical data, it is typically the case that some combinations of variables are theo-retically...
This paper presents some theoretical findings from our recent methodological research addressing the...
Item-nonresponse is often treated by means of an imputation technique. In some cases, the data have ...
Data cleaning is an action which includes a process of correcting and identifying the inconsistencie...
The paper is concerned with the problem of automatic detection and correction of inconsistent or out...