In scientific cloud workflows, large amounts of application data need to be stored in distributed data centres. To effectively store these data, a data manager must intelligently select data centres in which these data will reside. This is, however, not the case for data which must have a fixed location. When one task needs several datasets located in different data centres, the movement of large volumes of data becomes a challenge. In this paper, we propose a matrix based k-means clustering strategy for data placement in scientific cloud workflows. The strategy contains two algorithms that group the existing datasets in k data centres during the workflow build-time stage, and dynamically clusters newly generated datasets to the most approp...
International audienceWith the evolution of geographically distributed data centers in the Cloud Com...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
Abstract — Scientific applications often perform complex computational analyses that consume and pro...
Scientific workflow is a complicated data intensive application. How to achieve an effective data pl...
International audienceWe consider the problem of optimizing the execution of data-intensive scientif...
Abstract — In this new era of Big Data, there is a growing need to enable scientific workflows to pe...
Due to the advantages of cost-effectiveness, on-demand resource provision and easy for sharing, clou...
Cloud computing emerges with high performance computing, massive data storage and easy access of the...
Recently, cloud computing has emerged as a promising computing infrastructure for performing scienti...
Due to its advantages of cost-effectiveness, on-demand provisioning and easy for sharing, cloud comp...
Scientific workflows benefit from the cloud computing paradigm, which offers access to virtual resou...
The data requirements of both scientific and commercial applications have been increasing drasticall...
Nowadays, more and more scientific experiments need to handle massive amounts of data. Their data pr...
Data-intensive scientific applications are posing many challenges in distributed computing systems. ...
International audienceThe current solutions for the parallel execution of scientific workflows are a...
International audienceWith the evolution of geographically distributed data centers in the Cloud Com...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
Abstract — Scientific applications often perform complex computational analyses that consume and pro...
Scientific workflow is a complicated data intensive application. How to achieve an effective data pl...
International audienceWe consider the problem of optimizing the execution of data-intensive scientif...
Abstract — In this new era of Big Data, there is a growing need to enable scientific workflows to pe...
Due to the advantages of cost-effectiveness, on-demand resource provision and easy for sharing, clou...
Cloud computing emerges with high performance computing, massive data storage and easy access of the...
Recently, cloud computing has emerged as a promising computing infrastructure for performing scienti...
Due to its advantages of cost-effectiveness, on-demand provisioning and easy for sharing, cloud comp...
Scientific workflows benefit from the cloud computing paradigm, which offers access to virtual resou...
The data requirements of both scientific and commercial applications have been increasing drasticall...
Nowadays, more and more scientific experiments need to handle massive amounts of data. Their data pr...
Data-intensive scientific applications are posing many challenges in distributed computing systems. ...
International audienceThe current solutions for the parallel execution of scientific workflows are a...
International audienceWith the evolution of geographically distributed data centers in the Cloud Com...
International audienceLarge-scale, data-intensive scientific applications are often expressed as sci...
Abstract — Scientific applications often perform complex computational analyses that consume and pro...