Abstract — Scientific applications often perform complex computational analyses that consume and produce large data sets. We are concerned with data placement policies that distribute data in ways that are advantageous for application execution, for example, by placing data sets so that they may be staged into or out of computations efficiently or by replicating them for improved performance and reliability. In particular, we propose to study the relationship between data placement services and workflow management systems. In this paper, we explore the interactions between two services used in large-scale science today. We evaluate the benefits of prestaging data using the Data Replication Service versus using the native data stage-in mecha...
Scientists today are exploring the use of new tools and computing platforms to do their science. The...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
Scientific applications on the Grid are in most cases heavily data-dependent. Therefore, improving s...
Abstract—As scientific applications generate and consume data at ever-increasing rates, scientific w...
Abstract — In this new era of Big Data, there is a growing need to enable scientific workflows to pe...
2012-11-21Scientific domains are increasingly adopting workflow systems to automate and manage large...
UnrestrictedIn recent years, scientific communities have increasingly adopted computational workflow...
Advanced scientific workflows running at extreme scale on high end computing platforms are providing...
In scientific cloud workflows, large amounts of application data need to be stored in distributed da...
The data requirements of both scientific and commercial applications have been increasing drasticall...
In this paper we examine the issue of optimizing disk usage and scheduling large-scale scientific wo...
Abstract. In recent years, there has been a growing demand on the required resources in terms of com...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
This paper describes the Pegasus framework that can be used to map complex scientific workflows onto...
This paper describes the Pegasus framework that can be used to map complex scientific workflows onto...
Scientists today are exploring the use of new tools and computing platforms to do their science. The...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
Scientific applications on the Grid are in most cases heavily data-dependent. Therefore, improving s...
Abstract—As scientific applications generate and consume data at ever-increasing rates, scientific w...
Abstract — In this new era of Big Data, there is a growing need to enable scientific workflows to pe...
2012-11-21Scientific domains are increasingly adopting workflow systems to automate and manage large...
UnrestrictedIn recent years, scientific communities have increasingly adopted computational workflow...
Advanced scientific workflows running at extreme scale on high end computing platforms are providing...
In scientific cloud workflows, large amounts of application data need to be stored in distributed da...
The data requirements of both scientific and commercial applications have been increasing drasticall...
In this paper we examine the issue of optimizing disk usage and scheduling large-scale scientific wo...
Abstract. In recent years, there has been a growing demand on the required resources in terms of com...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
This paper describes the Pegasus framework that can be used to map complex scientific workflows onto...
This paper describes the Pegasus framework that can be used to map complex scientific workflows onto...
Scientists today are exploring the use of new tools and computing platforms to do their science. The...
Empirical thesis.Bibliography: pages 75-76.1. Introduction -- 2. Background and related work -- 3. D...
Scientific applications on the Grid are in most cases heavily data-dependent. Therefore, improving s...