WAIM 2013International audienceMany data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called {\em data fusion}. This paper describes a novel approach that finds true values from conflicting information when there are a large number of sources, among which some may copy from others. We present a case study on real-world data showing that the described algorithm can sign...
Integrated information systems provide users and applications with a unified view of heterogeneous d...
Heterogeneous and dirty data is abundant. It is stored under different, often opaque schemata, it re...
The amount of useful information available on the Web has been growing at a dramatic pace in recent ...
WAIM 2013International audienceMany data management applications, such as setting up Web portals, ma...
Abstract. Many data management applications, such as setting up Web portals, managing enterprise dat...
The abundance of data available on the Web makes more and more probable the case of finding that dif...
Data fusion, within the data integration pipeline, addresses the problem of discovering the true val...
Data fusion is a major task in data management. Frequently, different sources store data about the s...
While the volume and variety of data furnished by disparate data sources has rocketed over the years...
A fundamental problem in data fusion is to determine the veracity of multi-source data in order to r...
In many domains, data cleaning is hampered by our limited ability to specify a comprehensive set of ...
Multiple descriptions about the same entity from different sources will inevitably result in data or...
A fundamental task in data integration is data fusion, the process of fusing multiple recordsreprese...
In many applications, one can obtain descriptions about the same objects or events from a variety of...
International audienceThis paper describes a new approach of heterogeneous data source fusion. Data ...
Integrated information systems provide users and applications with a unified view of heterogeneous d...
Heterogeneous and dirty data is abundant. It is stored under different, often opaque schemata, it re...
The amount of useful information available on the Web has been growing at a dramatic pace in recent ...
WAIM 2013International audienceMany data management applications, such as setting up Web portals, ma...
Abstract. Many data management applications, such as setting up Web portals, managing enterprise dat...
The abundance of data available on the Web makes more and more probable the case of finding that dif...
Data fusion, within the data integration pipeline, addresses the problem of discovering the true val...
Data fusion is a major task in data management. Frequently, different sources store data about the s...
While the volume and variety of data furnished by disparate data sources has rocketed over the years...
A fundamental problem in data fusion is to determine the veracity of multi-source data in order to r...
In many domains, data cleaning is hampered by our limited ability to specify a comprehensive set of ...
Multiple descriptions about the same entity from different sources will inevitably result in data or...
A fundamental task in data integration is data fusion, the process of fusing multiple recordsreprese...
In many applications, one can obtain descriptions about the same objects or events from a variety of...
International audienceThis paper describes a new approach of heterogeneous data source fusion. Data ...
Integrated information systems provide users and applications with a unified view of heterogeneous d...
Heterogeneous and dirty data is abundant. It is stored under different, often opaque schemata, it re...
The amount of useful information available on the Web has been growing at a dramatic pace in recent ...