Many scientists are using workflows to systematically design and run computational experiments. Once the workflow is executed, the scientist may want to publish the dataset generated as a result, to be, e.g., reused by other scientists as input to their experiments. In doing so, the scientist needs to curate such dataset by specifying metadata information that describes it, e.g. its derivation history, origins and ownership. To assist the scientist in this task, we explore in this paper the use of provenance traces collected by workflow management systems when enacting workflows. Specifically, we identify the shortcomings of such raw provenance traces in supporting the data publishing task, and propose an approach whereby distilled, yet mor...
Data management is growing in complexity as large-scale applications take advantage of the loosely c...
Journal ArticleThe automated tracking and storage of provenance information promises to be a major a...
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on hav...
The automated tracking and storage of provenance information promises to be a major advantage of sci...
Provenance traces captured by scientific workflows can be useful for designing, debugging and mainte...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in ...
Integrated provenance support promises to be a chief advantage of scientific workflow systems over s...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on hav...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Data management is growing in complexity as large-scale applications take advantage of the loosely c...
Journal ArticleThe automated tracking and storage of provenance information promises to be a major a...
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on hav...
The automated tracking and storage of provenance information promises to be a major advantage of sci...
Provenance traces captured by scientific workflows can be useful for designing, debugging and mainte...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in ...
Integrated provenance support promises to be a chief advantage of scientific workflow systems over s...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific workflows have recently emerged as a new paradigm for representing and managing complex d...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on hav...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Data management is growing in complexity as large-scale applications take advantage of the loosely c...
Journal ArticleThe automated tracking and storage of provenance information promises to be a major a...
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on hav...