The proliferation of cloud computing allows scientists to deploy computation and data intensive applications without infrastructure investment, where large generated datasets can be flexibly stored with multiple cloud service providers. Due to the pay-as-you-go model, the total application cost largely depends on the usage of computation, storage and bandwidth resources, and cutting the cost of cloud-based data storage becomes a big concern for deploying scientific applications in the cloud. In this paper, we propose a novel algorithm that can automatically decide whether a generated dataset should be 1) stored in the current cloud, 2) deleted and re-generated whenever reused or 3) transferred to cheaper cloud service for storage. The algor...
Many scientific workflows are data intensive: large volumes of intermediate datasets are generated d...
Many scientific workflows are data intensive where a large volume of intermediate data is generated ...
Cloud computing provides access to a large scale set of readily available computing resources at the...
Nowadays, scientific research increasingly relies on IT technologies, where large-scale and high per...
The proliferation of cloud computing allows users to flexibly store, re-compute or transfer large ge...
Computation and Storage in the Cloud is the first comprehensive and systematic work investigating th...
Massive computation power and storage capacity of cloud computing systems allow scientists to deploy...
Scientific applications are usually data intensive [1,~ 2], where the generated datasets are often t...
Scientific applications are usually data intensive [1,~ 2], where the generated datasets are often t...
Abstract—Massive computation power and storage capacity of cloud computing systems enable users to e...
Massive computation power and storage capacity of cloud computing systems allow scientists to deploy...
Abstract — Massive computation power and storage capacity of cloud computing systems allow scientist...
Abstract—Massive computation power and storage capacity of cloud computing systems allow scientists ...
Massive computation power and storage capacity of cloud computing systems enable users to either sto...
Massive computation power and storage capacity of cloud computing systems enable users to either sto...
Many scientific workflows are data intensive: large volumes of intermediate datasets are generated d...
Many scientific workflows are data intensive where a large volume of intermediate data is generated ...
Cloud computing provides access to a large scale set of readily available computing resources at the...
Nowadays, scientific research increasingly relies on IT technologies, where large-scale and high per...
The proliferation of cloud computing allows users to flexibly store, re-compute or transfer large ge...
Computation and Storage in the Cloud is the first comprehensive and systematic work investigating th...
Massive computation power and storage capacity of cloud computing systems allow scientists to deploy...
Scientific applications are usually data intensive [1,~ 2], where the generated datasets are often t...
Scientific applications are usually data intensive [1,~ 2], where the generated datasets are often t...
Abstract—Massive computation power and storage capacity of cloud computing systems enable users to e...
Massive computation power and storage capacity of cloud computing systems allow scientists to deploy...
Abstract — Massive computation power and storage capacity of cloud computing systems allow scientist...
Abstract—Massive computation power and storage capacity of cloud computing systems allow scientists ...
Massive computation power and storage capacity of cloud computing systems enable users to either sto...
Massive computation power and storage capacity of cloud computing systems enable users to either sto...
Many scientific workflows are data intensive: large volumes of intermediate datasets are generated d...
Many scientific workflows are data intensive where a large volume of intermediate data is generated ...
Cloud computing provides access to a large scale set of readily available computing resources at the...