Abstract. Real-life date is often dirty and costs billions of pounds to businesses worldwide each year. This paper presents a promising ap-proach to improving data quality. It effectively detects and fixes inconsis-tencies in real-life data based on conditional dependencies, an extension of database dependencies by enforcing bindings of semantically related data values. It accurately identifies records from unreliable data sources by leveraging relative candidate keys, an extension of keys for relations by supporting similarity and matching operators across relations. In con-trast to traditional dependencies that were developed for improving the quality of schema, the revised constraints are proposed to improve the quality of data. These co...
Integrity constraints, a.k.a. data dependencies, are being widely used for improving the quality of ...
International audienceDriven by the dominance of the relational model, we investigate how the requir...
Matching dependencies (MDs) are recently proposed for various data quality applications such as dete...
Matching dependencies (MDS) have recently been proposed to make data dependencies tolerant to variou...
The objective of this study has been to answer the question of finding ways to improve the quality o...
To accurately match records it is often necessary to utilize the se-mantics of the data. Functional ...
Recently, there has been a renovated interest in functional dependencies due to the possibility of e...
We present SEMANDAQ, a prototype system for improving the quality of relational data. Based on the r...
Although integrity constraints are the primary means for enforcing data integrity, there are cases i...
Traditionally dependencies are used in database design and data quality control. In knowledge discov...
Relational database schemas must be semantically enriched to reflect knowledge about the data, as ne...
The application and exploitation of large amounts of data play an ever-increasing role in today’s re...
In banks, governments, and Internet companies, inconsistent data problems may often arise when vario...
International audienceMany health care systems and services exploit drug related information stored ...
This paper propose an efficient data cleaning by using extended conditional functional dependencies ...
Integrity constraints, a.k.a. data dependencies, are being widely used for improving the quality of ...
International audienceDriven by the dominance of the relational model, we investigate how the requir...
Matching dependencies (MDs) are recently proposed for various data quality applications such as dete...
Matching dependencies (MDS) have recently been proposed to make data dependencies tolerant to variou...
The objective of this study has been to answer the question of finding ways to improve the quality o...
To accurately match records it is often necessary to utilize the se-mantics of the data. Functional ...
Recently, there has been a renovated interest in functional dependencies due to the possibility of e...
We present SEMANDAQ, a prototype system for improving the quality of relational data. Based on the r...
Although integrity constraints are the primary means for enforcing data integrity, there are cases i...
Traditionally dependencies are used in database design and data quality control. In knowledge discov...
Relational database schemas must be semantically enriched to reflect knowledge about the data, as ne...
The application and exploitation of large amounts of data play an ever-increasing role in today’s re...
In banks, governments, and Internet companies, inconsistent data problems may often arise when vario...
International audienceMany health care systems and services exploit drug related information stored ...
This paper propose an efficient data cleaning by using extended conditional functional dependencies ...
Integrity constraints, a.k.a. data dependencies, are being widely used for improving the quality of ...
International audienceDriven by the dominance of the relational model, we investigate how the requir...
Matching dependencies (MDs) are recently proposed for various data quality applications such as dete...