Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of processes. To carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with the resource manager and, through the runtime, to change its number of MPI ranks. The collaboration between both the workload manager—aware of the queue of jobs and the resources allocation—and the parallel runtime—able to transparently handle the processes and the program data—is crucial for our throughput-aware malleability methodology. Hence, when a job triggers a reconfiguration, the resource manager will check the cluster status and return the appropriate action: i) expand, if there are spare resources; ii) shrink, if qu...
Until recent years most parallel machines have been made up of closely-coupled microprocessor-based ...
International audienceIn large-scale distributed execution environments such as multicluster systems...
National audienceCurrent parallel architectures take advantage of new hardware evolution, like the u...
Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of pro...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
Several studies have proved the benefits of job malleability, that is, the capacity of an applicatio...
In the design of future HPC systems, research in resource management is showing an increasing intere...
Part 4: Green Computing and Resource ManagementInternational audienceWe present a resource-aware sch...
In today’s batch queue HPC cluster systems, the user submits a job requesting a fixed number of...
Adaptive parallel applications that can change resources during execution, promise better system uti...
Abstract. In this paper, we describe DyRecT (Dynamic Reconfiguration Toolkit) a software library tha...
Scientific workflow management systems like Nextflow support large-scale data analysis by abstractin...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
Until recent years most parallel machines have been made up of closely-coupled microprocessor-based ...
International audienceIn large-scale distributed execution environments such as multicluster systems...
National audienceCurrent parallel architectures take advantage of new hardware evolution, like the u...
Adaptive workloads can change on–the–fly the configuration of their jobs, in terms of number of pro...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
Process malleability has proved to have a highly positive impact on the resource utilization and glo...
In this paper we introduce a methodology for dynamic job reconfiguration driven by the programming m...
Several studies have proved the benefits of job malleability, that is, the capacity of an applicatio...
In the design of future HPC systems, research in resource management is showing an increasing intere...
Part 4: Green Computing and Resource ManagementInternational audienceWe present a resource-aware sch...
In today’s batch queue HPC cluster systems, the user submits a job requesting a fixed number of...
Adaptive parallel applications that can change resources during execution, promise better system uti...
Abstract. In this paper, we describe DyRecT (Dynamic Reconfiguration Toolkit) a software library tha...
Scientific workflow management systems like Nextflow support large-scale data analysis by abstractin...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
Until recent years most parallel machines have been made up of closely-coupled microprocessor-based ...
International audienceIn large-scale distributed execution environments such as multicluster systems...
National audienceCurrent parallel architectures take advantage of new hardware evolution, like the u...