Setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration based on Functional Dependencies), a pay-as-you-go data integration system that allows integrating a given set of data sources, as well as incrementally integrating additional sources. IFD takes advantage of the background knowledge implied within functional dependencies for matching the source schemas. Our system is built on a probabilistic data model that allows capturing the uncertainty in data integration systems. Our performance evaluation results show significant performance gains of our approach in te...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
The integration of both distributed schemas and data repositories is a major challenge in data and k...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Data integration systems offer uniform access to a set of autonomous and heterogeneous data sources....
Abstract. Data integration systems offer uniform access to a set of au-tonomous and heterogeneous da...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Lecture Notes in Computer Science, Vol. 9587Data integration typically seeks to provide the illusion...
In data integration efforts such as in portal development, much development time is devoted to entit...
One of the problems in data integration is data overlap: the fact that different data sources have d...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Functional dependencies – traditional, approximate and con-ditional are of critical importance in re...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
Besides the scientific paradigms of empiricism, mathematical modelling, and simulation, the method o...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
The integration of both distributed schemas and data repositories is a major challenge in data and k...
Part 1: ConferenceInternational audienceSetting up a full data integration system for many applicati...
Data integration systems are crucial for applications that need to provide a uniform interface to a ...
Data integration systems offer uniform access to a set of autonomous and heterogeneous data sources....
Abstract. Data integration systems offer uniform access to a set of au-tonomous and heterogeneous da...
Abstract Data integration has been an important area of research for several years. In this chapter,...
Lecture Notes in Computer Science, Vol. 9587Data integration typically seeks to provide the illusion...
In data integration efforts such as in portal development, much development time is devoted to entit...
One of the problems in data integration is data overlap: the fact that different data sources have d...
This paper reports our first set of results on managing uncertainty in data integration. We posit th...
Functional dependencies – traditional, approximate and con-ditional are of critical importance in re...
Abstract — One of the problems in data integration is data overlap: the fact that different data sou...
Besides the scientific paradigms of empiricism, mathematical modelling, and simulation, the method o...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
Data integration approaches mostly attempt to resolve semantic uncertainty and conflicts between dat...
The integration of both distributed schemas and data repositories is a major challenge in data and k...