The World Wide Web evolves into a Web of Data, a huge, globally distributed dataspace that contains a rich body of machine-processable information from a virtually unbound set of providers covering a wide range of topics. However, due to the openness of the Web little is known about who created the data and how. The fact that a large amount of the data on the Web is derived by replication, query processing, modification, or merging raises concerns of information quality. Poor quality data may propagate quickly and contaminate the Web of Data. Provenance information about who created and published the data and how, provides the means for quality assessment. This paper takes a first step towards creating a quality-aware Web of Data: we presen...
Abstract. Many curated databases are constructed by scientists inte-grating various existing data so...
Many curated databases are constructed by scientists integrating various existing data sources. Most...
The Web is now being used as a platform for publishing and linking life science data. The Web's link...
The open world of the (Semantic) Web is a global information space offering diverse materials of dis...
Abstract. In this paper, we look at Web data that comes from multiple sources, as in the Web 2.0. We...
This article presents a general overview of digital metadata and provenance data definitions. Ideas ...
Data management is growing in complexity as large-scale applications take advantage of the loosely c...
This chapter outlines some of the challenges and opportunities associated with adopting provenance p...
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely...
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely...
In many application areas like e-science and data-warehousing detailed information about the origin ...
The ease with which one can copy and transform data on the Web, has made it increasingly difficult t...
Provenance is a record that describes the people, institutions, entities, and activities, involved i...
Data quality assessment is a key factor in data-intensive domains. The data deluge is aggravated by ...
Data management is growing in complexity as largescale applications take advantage of the loosely co...
Abstract. Many curated databases are constructed by scientists inte-grating various existing data so...
Many curated databases are constructed by scientists integrating various existing data sources. Most...
The Web is now being used as a platform for publishing and linking life science data. The Web's link...
The open world of the (Semantic) Web is a global information space offering diverse materials of dis...
Abstract. In this paper, we look at Web data that comes from multiple sources, as in the Web 2.0. We...
This article presents a general overview of digital metadata and provenance data definitions. Ideas ...
Data management is growing in complexity as large-scale applications take advantage of the loosely c...
This chapter outlines some of the challenges and opportunities associated with adopting provenance p...
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely...
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely...
In many application areas like e-science and data-warehousing detailed information about the origin ...
The ease with which one can copy and transform data on the Web, has made it increasingly difficult t...
Provenance is a record that describes the people, institutions, entities, and activities, involved i...
Data quality assessment is a key factor in data-intensive domains. The data deluge is aggravated by ...
Data management is growing in complexity as largescale applications take advantage of the loosely co...
Abstract. Many curated databases are constructed by scientists inte-grating various existing data so...
Many curated databases are constructed by scientists integrating various existing data sources. Most...
The Web is now being used as a platform for publishing and linking life science data. The Web's link...