International audienceWith their globally distributed datacenters, clouds now provide an opportunity to run complex large-scale applications on dynamically provisioned, networked and federated infrastructures. However, there is a lack of tools supporting data-intensive applications across geographically distributed sites. For instance, scientific workflows which handle many small files can easily saturate state-of-the-art distributed filesystems based on centralized metadata servers (e.g. HDFS, PVFS). In this paper, we explore several alternative design strategies to efficiently support the execution of existing workflow engines across multi-site clouds, by reducing the cost of metadata operations. These strategies leverage workflow semanti...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale scientific applications are often expressed as workflows that help...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
By 2020, the digital universe is expected to reach 44 zettabytes, as it is doubling every two years....
International audienceLarge-scale scientific applications are often expressed as workflows that help...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceThe current solutions for the parallel execution of scientific workflows are a...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale scientific applications are often expressed as workflows that help...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
International audienceWith their globally distributed datacenters, clouds now provide an opportunity...
By 2020, the digital universe is expected to reach 44 zettabytes, as it is doubling every two years....
International audienceLarge-scale scientific applications are often expressed as workflows that help...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceThe current solutions for the parallel execution of scientific workflows are a...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
International audienceLarge-scale scientific applications are often expressed as workflows that help...
International audienceThe global deployment of cloud datacenters is enabling large scale scientific ...