International audienceThis paper describes dstep, a directive-based programming model for hybrid shared and distributed memory machines. The originality of our work is the definition and an implementation of a unified high-level programming model addressing both data and computation distributions, providing a particularly fine control of the computation. The goal is to improve the programmer productivity while providing good performances in terms of execution time and memory usage. We define a generic compilation scheme for computation mapping and communication generation. We implement the solution in a source-to-source compiler together with a runtime library. We provide a series of optimizations to improve the performance of the generated...
Massively parallel MIMD distributed-memory machines can provide enormous computational power; howeve...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
The shared-memory programming model can be an effective way to achieve parallelism on shared memory ...
In recent years, distributed memory parallel machines have been widely recognized as the most likely...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
Distributed Memory Multicomputers (DMMs) such as the IBM SP-2, the Intel Paragon and the Thinking Ma...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
This paper describes the design and implementation of a scalable run-time system and an optimizing c...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
With the end of Dennard scaling, future high performance computers are expected to consist of distri...
Advances in computing and networking infrastructure have enabled an increasing number of application...
With the advent of Distributed Memory Machines (DMMs) numerous work have been undertaken to ease the...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
In high-performance computing, hybrid systems are defined as architectures where shared and distribu...
Massively parallel MIMD distributed-memory machines can provide enormous computational power; howeve...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
The shared-memory programming model can be an effective way to achieve parallelism on shared memory ...
In recent years, distributed memory parallel machines have been widely recognized as the most likely...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
Distributed Memory Multicomputers (DMMs) such as the IBM SP-2, the Intel Paragon and the Thinking Ma...
Over the past few decades, scientific research has grown to rely increasingly on simulation and othe...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
This paper describes the design and implementation of a scalable run-time system and an optimizing c...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
With the end of Dennard scaling, future high performance computers are expected to consist of distri...
Advances in computing and networking infrastructure have enabled an increasing number of application...
With the advent of Distributed Memory Machines (DMMs) numerous work have been undertaken to ease the...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
In high-performance computing, hybrid systems are defined as architectures where shared and distribu...
Massively parallel MIMD distributed-memory machines can provide enormous computational power; howeve...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
The shared-memory programming model can be an effective way to achieve parallelism on shared memory ...