Realistic, relevant, and reproducible experiments often need input traces collected from real-world environments. In this work, we focus on traces of workflows - common in datacenters, clouds, and HPC infrastructures. We show that the state-of-the-art in using workflow-traces raises important issues: (1) the use of realistic traces is infrequent and (2) the use of realistic, open-access traces even more so. Alleviating these issues, we introduce the Workflow Trace Archive (WTA), an open-access archive of workflow traces from diverse computing infrastructures and tooling to parse, validate, and analyze traces. The WTA includes {>}48>48 million workflows captured from {>}10>10 computing infrastructures, representing a broad diversity of trace...
One of the foundations of science is that researchers must publish the methodology used to achieve t...
In recent years, a variety of systems have been developed that export the workflows used to analyze ...
Workflows provide a popular means for preserving scientific methods by explicitly encoding their pro...
International audiencePROV has been adopted by a number of workflow systems for encoding the traces ...
Many scientists are using workflows to systematically design and run computational experiments. Once...
Workflows have become a popular means for implementing experiments in computational sciences. They a...
International audienceSHARP is a Linked Data approach for harmonizing cross-workflow provenance. In ...
Abstract—Scientific collaboration increasingly involves data sharing between separate groups. We con...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Repeated executions of resource-intensive workflows over a large number of runs are commonly observe...
© 2017 Elsevier B.V. The emergence of Cloud computing provides a new computing paradigm for scientif...
Workflows processing data from research activities and driving in silico experiments are becoming an...
Workflows have been used traditionally as a mean to describe and implement the computing usually par...
Abstract—A significant amount of recent research in scientific workflows aims to develop new techniq...
International audienceWorkflows may be defined as abstractions used to model the coherent flow of ac...
One of the foundations of science is that researchers must publish the methodology used to achieve t...
In recent years, a variety of systems have been developed that export the workflows used to analyze ...
Workflows provide a popular means for preserving scientific methods by explicitly encoding their pro...
International audiencePROV has been adopted by a number of workflow systems for encoding the traces ...
Many scientists are using workflows to systematically design and run computational experiments. Once...
Workflows have become a popular means for implementing experiments in computational sciences. They a...
International audienceSHARP is a Linked Data approach for harmonizing cross-workflow provenance. In ...
Abstract—Scientific collaboration increasingly involves data sharing between separate groups. We con...
Scientific collaboration increasingly involves data sharing between separate groups. We consider a s...
Repeated executions of resource-intensive workflows over a large number of runs are commonly observe...
© 2017 Elsevier B.V. The emergence of Cloud computing provides a new computing paradigm for scientif...
Workflows processing data from research activities and driving in silico experiments are becoming an...
Workflows have been used traditionally as a mean to describe and implement the computing usually par...
Abstract—A significant amount of recent research in scientific workflows aims to develop new techniq...
International audienceWorkflows may be defined as abstractions used to model the coherent flow of ac...
One of the foundations of science is that researchers must publish the methodology used to achieve t...
In recent years, a variety of systems have been developed that export the workflows used to analyze ...
Workflows provide a popular means for preserving scientific methods by explicitly encoding their pro...