Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016) Timisoara, Romania. February 8-11, 2016.The main purpose of this thesis is to research in the relation between task-based programming models and resource management systems in order to provide a smart autonomous load-balancing and fault-tolerant system. Thus, taking advantage of MPI malleable applications and execution models such as SMPD and MPMD we will dig in the principle of the dynamical reconfiguration. Apart from providing an overview of the thesis idea, this paper explains our initial motivation and reviews briefly the most remarkable work done in this field.This work is partially supported by EU under the COST Program Action IC1305: ...
This paper presents a high level dynamic task allocation algorithm that is inspired by the biologica...
The Complexity of emerging multi/many-core architectures and diversity of modern workloads demands c...
Runtime resource management for many-core systems is increasingly complex.The complexity can be due ...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
The use of distributed computing technology in real-time systems is rapidly increasing. Distributed ...
International audienceThe current static job scheduling on supercomputers for MPI-based applications...
International audienceJob management software on peta- and exascale supercomputers continues to prov...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
As new heterogeneous systems and hardware accelerators appear, high performance computers can reach ...
In a dynamic real-time environment, predictability needs to be provided in the face of unpredictable...
In this paper we describe the design of fault tolerance capabilities for general-purpose offload sem...
We present a new hierarchical approximation and scheduling approach for applications and tasks with ...
The Complexity of emerging multi/many-core architectures and diversity of modern workloads demands c...
This paper presents a high level dynamic task allocation algorithm that is inspired by the biologica...
The Complexity of emerging multi/many-core architectures and diversity of modern workloads demands c...
Runtime resource management for many-core systems is increasingly complex.The complexity can be due ...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
The use of distributed computing technology in real-time systems is rapidly increasing. Distributed ...
International audienceThe current static job scheduling on supercomputers for MPI-based applications...
International audienceJob management software on peta- and exascale supercomputers continues to prov...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
As new heterogeneous systems and hardware accelerators appear, high performance computers can reach ...
In a dynamic real-time environment, predictability needs to be provided in the face of unpredictable...
In this paper we describe the design of fault tolerance capabilities for general-purpose offload sem...
We present a new hierarchical approximation and scheduling approach for applications and tasks with ...
The Complexity of emerging multi/many-core architectures and diversity of modern workloads demands c...
This paper presents a high level dynamic task allocation algorithm that is inspired by the biologica...
The Complexity of emerging multi/many-core architectures and diversity of modern workloads demands c...
Runtime resource management for many-core systems is increasingly complex.The complexity can be due ...