Many important scientific applications do not fit the tradi-tional model of a monolithic simulation running on thou-sands of nodes. Scientific workflows – such as the Materi-als Genome project, Energy Frontiers Research Center for Gas Separations Relevant to Clean Energy Technologies, cli-mate simulations, and Uncertainty Quantification in fluid and solid dynamics – all run large numbers of parallel anal-yses, which we call scientific ensembles. These scientific ensembles have a large number of tasks with control and data dependencies. Current tools for creating and manag-ing these ensembles in HPC environments are limited and difficult to use; this is proving to be a limiting factor to run-ning scientific ensembles at the large scale enabl...
Monte Carlo simulations employed for the analysis of portfo-lios of catastrophic risk process large ...
Abstract — Applying high level parallel runtimes to data/compute intensive applications is becoming ...
— In this paper, we discuss on the VELaSSCo project (Visualization for Extremely LArge-Scale Scienti...
The ever-increasing volumes of scientific data com- bined with sophisticated techniques for extracti...
In the last decade, our ability to store data has grown at a greater rate than our ability to proces...
As the data growth rate outpace that of the processing capabilities of CPUs, reaching Petascale, tec...
In this work we present an scientific application that has been given a Hadoop MapReduce implementat...
Cloud Computing has gained a lot of popularity in recent years because of the flexibility that it of...
We propose a new ensemble algorithm: the meta-boosting algorithm. This algorithm enables the origina...
MapReduce, the popular programming paradigm for large-scale data processing, has traditionally been ...
The computing power of modern high performance systems cannot be fully exploited using traditional p...
Abstract—MapReduce is a powerful data processing platform for commercial and academic applications. ...
Large quantities of data have been generated from multiple sources at exponential rates in the last ...
The Apache Hadoop framework has rung in a new era in how data-rich organizations can process, store,...
Abstract — Cloud Computing is emerging as a new computational paradigm shift.Hadoop MapReduce has be...
Monte Carlo simulations employed for the analysis of portfo-lios of catastrophic risk process large ...
Abstract — Applying high level parallel runtimes to data/compute intensive applications is becoming ...
— In this paper, we discuss on the VELaSSCo project (Visualization for Extremely LArge-Scale Scienti...
The ever-increasing volumes of scientific data com- bined with sophisticated techniques for extracti...
In the last decade, our ability to store data has grown at a greater rate than our ability to proces...
As the data growth rate outpace that of the processing capabilities of CPUs, reaching Petascale, tec...
In this work we present an scientific application that has been given a Hadoop MapReduce implementat...
Cloud Computing has gained a lot of popularity in recent years because of the flexibility that it of...
We propose a new ensemble algorithm: the meta-boosting algorithm. This algorithm enables the origina...
MapReduce, the popular programming paradigm for large-scale data processing, has traditionally been ...
The computing power of modern high performance systems cannot be fully exploited using traditional p...
Abstract—MapReduce is a powerful data processing platform for commercial and academic applications. ...
Large quantities of data have been generated from multiple sources at exponential rates in the last ...
The Apache Hadoop framework has rung in a new era in how data-rich organizations can process, store,...
Abstract — Cloud Computing is emerging as a new computational paradigm shift.Hadoop MapReduce has be...
Monte Carlo simulations employed for the analysis of portfo-lios of catastrophic risk process large ...
Abstract — Applying high level parallel runtimes to data/compute intensive applications is becoming ...
— In this paper, we discuss on the VELaSSCo project (Visualization for Extremely LArge-Scale Scienti...