The emergence of accelerators as standard computing resources on supercomputers and the subsequent architectural complexity increase revived the need for high-level parallel programming paradigms. Sequential task-based programming model has been shown to efficiently meet this challenge on a single multicore node possibly enhanced with accelerators, which motivated its support in the OpenMP 4.0 standard. In this paper, we show that this paradigm can also be employed to achieve high performance on modern supercomputers composed of multiple such nodes, with extremely limited changes in the user code. To prove this claim, we have extended the StarPU runtime system with an advanced inter-node data management layer that supports this model by pos...
Heterogeneous architectures have been widely used in the domain of high performance computing. Howev...
Article publié à Techniques et Sciences Informatiques Volume 28. n° 1/2009National audienceL'ordonna...
The significant increase of the hardware complexity that occurred in the last few years led the high...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Task-based models and runtimes are quite popular in the HPC community. Theyhelp to implement applica...
With the emergence of highly heterogeneous, dynamic and large distributed platforms, declarative pro...
This paper deals with the efficient combination of software components and task-based models for HPC...
With the advent of complex modern architectures, the low-levelparadigms long considered sufficient t...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
In this paper, we describe and evaluate an extension of the Chameleon library to operate with hierar...
A now-classical way of meeting the increasing demand for computing speed by HPC applications is the ...
This thesis intends to show how to efficiently exploit the parallelism present in applications in or...
Parallel programs need to manage the time trade-off between synchronization and computation. A high ...
As single processing unit performance has reached a technological limit, the power wall, the past de...
Heterogeneous architectures have been widely used in the domain of high performance computing. Howev...
Article publié à Techniques et Sciences Informatiques Volume 28. n° 1/2009National audienceL'ordonna...
The significant increase of the hardware complexity that occurred in the last few years led the high...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Task-based models and runtimes are quite popular in the HPC community. Theyhelp to implement applica...
With the emergence of highly heterogeneous, dynamic and large distributed platforms, declarative pro...
This paper deals with the efficient combination of software components and task-based models for HPC...
With the advent of complex modern architectures, the low-levelparadigms long considered sufficient t...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
In this paper, we describe and evaluate an extension of the Chameleon library to operate with hierar...
A now-classical way of meeting the increasing demand for computing speed by HPC applications is the ...
This thesis intends to show how to efficiently exploit the parallelism present in applications in or...
Parallel programs need to manage the time trade-off between synchronization and computation. A high ...
As single processing unit performance has reached a technological limit, the power wall, the past de...
Heterogeneous architectures have been widely used in the domain of high performance computing. Howev...
Article publié à Techniques et Sciences Informatiques Volume 28. n° 1/2009National audienceL'ordonna...
The significant increase of the hardware complexity that occurred in the last few years led the high...