We illustrate how combining retrospective and prospectiveprovenance can yield scientifically meaningful hybrid provenancerepresentations of the computational histories of data produced during a script run. We use scripts from multiple disciplines (astrophysics, climate science, biodiversity data curation, and social network analysis), implemented in Python, R, and MATLAB, to highlight the usefulness of diverse forms of retrospectiveprovenance when coupled with prospectiveprovenance. Users provide prospective provenance, i.e., the conceptual workflows latent in scripts, via simple YesWorkflow annotations, embedded as script comments. Runtime observables can be linked to prospective provenance via relational views and queries. These observabl...
As scientific workflows, and the data they operate on, grow in size and complexity, the task of defi...
Capturing provenance about artifacts produced by distributed scientific processes is a challenging t...
Data provenance is the history of a digital artifact, from the point of collection to its present<br...
The YesWorkflow McPhillips et al. 2015b, McPhillips et al. 2015a toolkit was designed to annotate da...
Abstract. We propose noWorkflow, a tool that transparently captures provenance of scripts and enable...
Integrated provenance support promises to be a chief advantage of scientific workflow systems over s...
Scientists require provenance information either to validate their model or to investigate the origi...
Within computer science, the term provenance has multiple meanings, due to different motivations, pe...
Journal ArticleThe automated tracking and storage of provenance information promises to be a major a...
We present a technique to capture retrospective provenance across a number of tools in a statistical...
Current scientific applications are often structured as workflows and rely on workflow systems to co...
Most workflow systems that support data provenance primarily focus on tracing lineage of data. Data ...
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in ...
dissertationServing as a record of what happened during a scientific process, often computational, p...
Scientists can facilitate data intensive applications to study and understand the behavior of a comp...
As scientific workflows, and the data they operate on, grow in size and complexity, the task of defi...
Capturing provenance about artifacts produced by distributed scientific processes is a challenging t...
Data provenance is the history of a digital artifact, from the point of collection to its present<br...
The YesWorkflow McPhillips et al. 2015b, McPhillips et al. 2015a toolkit was designed to annotate da...
Abstract. We propose noWorkflow, a tool that transparently captures provenance of scripts and enable...
Integrated provenance support promises to be a chief advantage of scientific workflow systems over s...
Scientists require provenance information either to validate their model or to investigate the origi...
Within computer science, the term provenance has multiple meanings, due to different motivations, pe...
Journal ArticleThe automated tracking and storage of provenance information promises to be a major a...
We present a technique to capture retrospective provenance across a number of tools in a statistical...
Current scientific applications are often structured as workflows and rely on workflow systems to co...
Most workflow systems that support data provenance primarily focus on tracing lineage of data. Data ...
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in ...
dissertationServing as a record of what happened during a scientific process, often computational, p...
Scientists can facilitate data intensive applications to study and understand the behavior of a comp...
As scientific workflows, and the data they operate on, grow in size and complexity, the task of defi...
Capturing provenance about artifacts produced by distributed scientific processes is a challenging t...
Data provenance is the history of a digital artifact, from the point of collection to its present<br...