Data integration systems offer uniform access to a set of autonomous and heterogeneous data sources. An important task in setting up a data integration system is to match the attributes of the source schemas. In this paper, we propose a data integration system which uses the knowledge implied within functional dependencies for matching the source schemas. We build our system on a probabilistic data model to capture the uncertainty arising during the matching process. Our performance validation confirms the importance of functional dependencies and also using a probabilistic data model in improving the quality of schema matching. Our experimental results show significant performance gain compared to the baseline approaches. They also show th...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
In data integration we transform information from a source into a target schema. A general problem i...
Abstract. Data integration systems offer uniform access to a set of au-tonomous and heterogeneous da...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Setting up a full data integration system for many application contexts, e.g. web and scientific dat...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Real world applications that deal with information extraction, such as business intelligence softwar...
Probabilistic data integration is a specific kind of data integration where integration problems suc...
Uncertainty is an intrinsic feature of automatic and semiautomatic data integration processes. Alth...
One of the problems in data integration is data overlap: the fact that different data sources have d...
Data integration has been a challenging problem for decades. In an ambient environment, where many a...
This paper proposes a method for the automatic discovery of probabilistic relationships in the envir...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
In data integration we transform information from a source into a target schema. A general problem i...
Abstract. Data integration systems offer uniform access to a set of au-tonomous and heterogeneous da...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Setting up a full data integration system for many application contexts, e.g. web and scientific dat...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Real world applications that deal with information extraction, such as business intelligence softwar...
Probabilistic data integration is a specific kind of data integration where integration problems suc...
Uncertainty is an intrinsic feature of automatic and semiautomatic data integration processes. Alth...
One of the problems in data integration is data overlap: the fact that different data sources have d...
Data integration has been a challenging problem for decades. In an ambient environment, where many a...
This paper proposes a method for the automatic discovery of probabilistic relationships in the envir...
This work considers a problem of integrating heterogeneous semi--structured data sources with the pu...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
In data integration we transform information from a source into a target schema. A general problem i...