The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis workflows in cloud platforms. Currently, DMCF relies on the default storage of the public cloud provider for any I/O related operation. This implies that the I/O performance of DMCF is limited by the performance of the default storage. In this work we propose the usage of the Hercules system within DMCF as an ad-hoc storage system for temporary data produced inside workflow-based applications. Hercules is a distributed in-memory storage system highly scalable and easy to deploy. The proposed solution takes advantage of the scalability capabilities of Hercules to avoid the bandwidth limits of the default storage. Early experimental results are ...
Summarization: In the last decade, data processing systems started using main memory as much as poss...
Emerging scientific workflows in high performance computing (HPC) focus more on analysis rather than...
The widespread popularity of Cloudcomputing as a preferred platform for thedeployment of web applica...
The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis w...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
Proceedings of: First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014...
AbstractCloud computing systems provide scalable infrastructure to store and process Big Data genera...
This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastr...
The ever-increasing power of supercomputer systems is both driving and enabling the emergence of new...
The cloud is evolving due to additional demands introduced by new technological advancements and the...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
Over the next few years, the LHC will prepare for the upcoming High-Luminosity upgrade in which it i...
The computing frameworks running in the cloud environment at an extreme scale provide efficient and ...
Summarization: In the last decade, data processing systems started using main memory as much as poss...
Emerging scientific workflows in high performance computing (HPC) focus more on analysis rather than...
The widespread popularity of Cloudcomputing as a preferred platform for thedeployment of web applica...
The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis w...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
Proceedings of: First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014...
AbstractCloud computing systems provide scalable infrastructure to store and process Big Data genera...
This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastr...
The ever-increasing power of supercomputer systems is both driving and enabling the emergence of new...
The cloud is evolving due to additional demands introduced by new technological advancements and the...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
Over the next few years, the LHC will prepare for the upcoming High-Luminosity upgrade in which it i...
The computing frameworks running in the cloud environment at an extreme scale provide efficient and ...
Summarization: In the last decade, data processing systems started using main memory as much as poss...
Emerging scientific workflows in high performance computing (HPC) focus more on analysis rather than...
The widespread popularity of Cloudcomputing as a preferred platform for thedeployment of web applica...