Abstract. Data integration systems offer uniform access to a set of au-tonomous and heterogeneous data sources. An important task in setting up a data integration system is to match the attributes of the source schemas. In this paper, we propose a data integration system which uses the knowledge implied within functional dependencies for matching the source schemas. We build our system on a probabilistic data model to capture the uncertainty arising during the matching process. Our perfor-mance validation confirms the importance of functional dependencies and also using a probabilistic data model in improving the quality of schema matching. Our experimental results show significant performance gain compared to the baseline approaches. They ...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
Data integration has been a challenging problem for decades. In an ambient environment, where many a...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...
Data integration systems offer uniform access to a set of autonomous and heterogeneous data sources....
International audienceData integration systems off er uniform access to a set of autonomous and hete...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Setting up a full data integration system for many application contexts, e.g. web and scientific dat...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Real world applications that deal with information extraction, such as business intelligence softwar...
Uncertainty is an intrinsic feature of automatic and semiautomatic data integration processes. Alth...
Probabilistic data integration is a specific kind of data integration where integration problems suc...
This paper proposes a method for the automatic discovery of probabilistic relationships in the envir...
One of the problems in data integration is data overlap: the fact that different data sources have d...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
Data integration has been a challenging problem for decades. In an ambient environment, where many a...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...
Data integration systems offer uniform access to a set of autonomous and heterogeneous data sources....
International audienceData integration systems off er uniform access to a set of autonomous and hete...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Setting up a full data integration system for many application contexts, e.g. web and scientific dat...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Real world applications that deal with information extraction, such as business intelligence softwar...
Uncertainty is an intrinsic feature of automatic and semiautomatic data integration processes. Alth...
Probabilistic data integration is a specific kind of data integration where integration problems suc...
This paper proposes a method for the automatic discovery of probabilistic relationships in the envir...
One of the problems in data integration is data overlap: the fact that different data sources have d...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
Data integration has been a challenging problem for decades. In an ambient environment, where many a...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...