In the design of future HPC systems, research in resource management is showing an increasing interest in a more dynamic control of the available resources. It has been proven that enabling the jobs to change the number of computing resources at run time, i.e. their malleability, can significantly improve HPC system performance. However, job schedulers and applications typically do not support malleability due to the common belief that it introduces additional programming complexity and performance impact. This paper presents DROM, an interface that provides efficient malleability with no effort for program developers. The running application is enabled to adapt the number of threads to the number of assigned computing resources in a comple...
Traditionally, High Performance Computing (HPC) and Data Intensive (DI) workloads have been executed...
This work presents a HPC framework that provides new strategies for resource management and job sche...
International audienceThe Resource and Job Management System (RJMS) is a crucial system software par...
In the design of future HPC systems, research in resource management is showing an increasing intere...
In job scheduling, the concept of malleability has been explored since many years ago. Research show...
In recent years, high-performance computing research became essential in pushing the boundaries of w...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of pro...
Several studies have proved the benefits of job malleability, that is, the capacity of an applicatio...
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
International audienceIn large-scale distributed execution environments such as multicluster systems...
The adoption of graphic processor units (GPU) in high-performance computing (HPC) infrastructures de...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Traditionally, High Performance Computing (HPC) and Data Intensive (DI) workloads have been executed...
This work presents a HPC framework that provides new strategies for resource management and job sche...
International audienceThe Resource and Job Management System (RJMS) is a crucial system software par...
In the design of future HPC systems, research in resource management is showing an increasing intere...
In job scheduling, the concept of malleability has been explored since many years ago. Research show...
In recent years, high-performance computing research became essential in pushing the boundaries of w...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of pro...
Several studies have proved the benefits of job malleability, that is, the capacity of an applicatio...
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
International audienceIn large-scale distributed execution environments such as multicluster systems...
The adoption of graphic processor units (GPU) in high-performance computing (HPC) infrastructures de...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Traditionally, High Performance Computing (HPC) and Data Intensive (DI) workloads have been executed...
This work presents a HPC framework that provides new strategies for resource management and job sche...
International audienceThe Resource and Job Management System (RJMS) is a crucial system software par...