International audienceIn this paper we concentrate on embedded parallel architectures with heterogeneous memory management systems combining shared and local memories, and more precisely we focus on efficient data communications between the various architecture parts. We formulate explicit data transfers in a polyhedral context and give several strategies for managing efficient communications for redundantly stored/read data. This allows automatic DMA-style code generation for a variety of data mappings onto parallel processing elements. Our approach is validated on a wide series of data redistribution examples linked with a domain-specific parallelisation framework developed in Thales, SpearDE. We give the solution for efficient data trans...
Many computations can be structured as sets of communicating data-parallel tasks. Individual tasks m...
Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory a...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Programming for parallel architectures that do not have a shared address space is extremely difficul...
This paper presents a compiling technique to generate parallel code with explicit local communicatio...
This paper presents a compiling technique to generate parallel code with explicit local communicatio...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
this paper we develop the constraints that the projection functions must satisfy, based on the infor...
International audienceIn this paper, we present original techniques for the generation and the effic...
The motivation of this thesis was to develop strategies that would enable unstructured mesh based co...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
We report on the development of a new computational framework for efficiently carrying out parallel ...
An algorithm for mapping an arbitrary, multidimensional array onto an arbitrarily shaped multidimens...
Many computations can be structured as sets of communicating data-parallel tasks. Individual tasks m...
Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory a...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...
Programming for parallel architectures that do not have a shared address space is extremely difficul...
This paper presents a compiling technique to generate parallel code with explicit local communicatio...
This paper presents a compiling technique to generate parallel code with explicit local communicatio...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
this paper we develop the constraints that the projection functions must satisfy, based on the infor...
International audienceIn this paper, we present original techniques for the generation and the effic...
The motivation of this thesis was to develop strategies that would enable unstructured mesh based co...
Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...
We report on the development of a new computational framework for efficiently carrying out parallel ...
An algorithm for mapping an arbitrary, multidimensional array onto an arbitrarily shaped multidimens...
Many computations can be structured as sets of communicating data-parallel tasks. Individual tasks m...
Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory a...
This paper presents a technique for finding good distributions of arrays and suitable loop restructu...