This paper introduces provGen, a generator aimed at producing large synthetic provenance graphs with predictable properties and of arbitrary size. Synthetic provenance graphs serve two main purposes. Firstly, they provide a variety of controlled workloads that can be used to test storage and query capabilities of provenance management systems at scale. Secondly, they provide challenging testbeds for experimenting with graph algorithms for provenance analytics, an area of increasing research interest. provGen produces PROV graphs and stores them in a graph DBMS (Neo4J). A key feature is to let users control the relationship makeup and topological features of the graph, by providing a seed provenance pattern along with a set of constraints, e...
Since its inception, the PROV standard has been widely adopted as a standardized exchange format for...
Part 2: Full PapersInternational audienceProvenance is a record that describes the people, instituti...
The provenance community has built a number of systems to collect provenance, most of which assume t...
This paper introduces provGen, a generator aimed at producing large synthetic provenance graphs with...
Abstract. This paper introduces provGen, a generator aimed at pro-ducing large synthetic provenance ...
Data provenance is a structured form of metadata designed to record the activities and datasets invo...
Provenance is a record that describes the people, institutions, entities, and activities involved in...
Provenance generated by different workflow systems is generally expressed using different formats. T...
In a data-driven world, being able to record from where data was derived, and by whom is key. The w...
PROV is a specification, promoted by the World Wide Web consortium, for recording the provenance of ...
PROV-TEMPLATE is a declarative approach that allows designers and programmers to design and generate...
As more systems become PROV-enabled, there will be a cor- responding increase in the need to communi...
As data provenance becomes a significant metadata in validating the origin of information and assert...
Provenance metadata can be valuable in data sharing settings, where it can be used to help data cons...
Data provenance is information about where data come from (provenance data) and how they transform (...
Since its inception, the PROV standard has been widely adopted as a standardized exchange format for...
Part 2: Full PapersInternational audienceProvenance is a record that describes the people, instituti...
The provenance community has built a number of systems to collect provenance, most of which assume t...
This paper introduces provGen, a generator aimed at producing large synthetic provenance graphs with...
Abstract. This paper introduces provGen, a generator aimed at pro-ducing large synthetic provenance ...
Data provenance is a structured form of metadata designed to record the activities and datasets invo...
Provenance is a record that describes the people, institutions, entities, and activities involved in...
Provenance generated by different workflow systems is generally expressed using different formats. T...
In a data-driven world, being able to record from where data was derived, and by whom is key. The w...
PROV is a specification, promoted by the World Wide Web consortium, for recording the provenance of ...
PROV-TEMPLATE is a declarative approach that allows designers and programmers to design and generate...
As more systems become PROV-enabled, there will be a cor- responding increase in the need to communi...
As data provenance becomes a significant metadata in validating the origin of information and assert...
Provenance metadata can be valuable in data sharing settings, where it can be used to help data cons...
Data provenance is information about where data come from (provenance data) and how they transform (...
Since its inception, the PROV standard has been widely adopted as a standardized exchange format for...
Part 2: Full PapersInternational audienceProvenance is a record that describes the people, instituti...
The provenance community has built a number of systems to collect provenance, most of which assume t...