International audienceScientific insights in the coming decade will clearly depend on the effective processing of large datasets generated by dynamic heterogeneous applications typical of workflows in large data centers or of emerging fields like neuroscience. In this paper, we show how these big data workflows have a unique set of characteristics that pose challenges for leveraging HPC methodologies, particularly in scheduling. Our findings indicate that execution times for these workflows are highly unpredictable and are not correlated with the size of the dataset involved or the precise functions used in the analysis. We characterize this inherent variability and sketch the need for new scheduling approaches by quantifying significant ga...
This Thesis deals with the problem of scheduling applications on High-Performance Computing (HPC) ma...
Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both research and industr...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
International audienceWith the expected convergence between HPC, BigData and AI, new applications wi...
International audienceNew emerging fields are developing a growing number of large-scale application...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
The growth in data generated by private and public organizations leads to several opportunities to o...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
Traditional scheduling techniques are of a by-gone era and do not cater for the dynamism of new and ...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
International audienceExecuting Big Data workloads upon High Performance Computing (HPC) infrastract...
Task graphs provide a simple way to describe scientific workflows (sets of tasks with dependencies) ...
In high-performance computing (HPC), workflow-based workloads are usually data intensive for explora...
This Thesis deals with the problem of scheduling applications on High-Performance Computing (HPC) ma...
Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both research and industr...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...
International audienceWith the expected convergence between HPC, BigData and AI, new applications wi...
International audienceNew emerging fields are developing a growing number of large-scale application...
Many breakthroughs in scientific and industrial research are supported by simulations and calculatio...
The growth in data generated by private and public organizations leads to several opportunities to o...
International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplis...
Taufer, MichelaHigh performance computing (HPC) is undergoing many changes at both the system and wo...
Job scheduling in high-performance computing platforms is a hard problem that involves uncertainties...
Traditional scheduling techniques are of a by-gone era and do not cater for the dynamism of new and ...
The scale of scientific applications becomes increasingly large not only in computation, but also in...
International audienceExecuting Big Data workloads upon High Performance Computing (HPC) infrastract...
Task graphs provide a simple way to describe scientific workflows (sets of tasks with dependencies) ...
In high-performance computing (HPC), workflow-based workloads are usually data intensive for explora...
This Thesis deals with the problem of scheduling applications on High-Performance Computing (HPC) ma...
Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both research and industr...
International audienceWe propose a novel job scheduling approach for homogeneous cluster computing p...