Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Compilers for Parallel Computing (CPC)

Moretón Fernández, Ana
González Escribano, Arturo
Llanos Ferraris, Diego Rafael

January 2015

Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...

Communication-free data allocation techniques for parallelizing compilers on multicomputers

Chen,Tzung-Shi
Sheu,Jang-Ping

[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...

A global communication optimization technique based on data-flow analysis and linear algebra

M. Kandemir
P. Banerjee
A. Choudhary
J. Ramanujam
N. Shenoy

January 1999

Reducing communication overhead is extremely important in distributed-memory message-passing archite...

Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Dathathri, Roshan
Reddy, Chandan
Ramashekar, Thejas
Bondhugula, Uday

January 2013

Programming for parallel architectures that do not have a shared address space is extremely difficul...

Automatic Generation of Communications for Redundant Multi-dimensional Data Parallel Redistributions

Ancourt, Corinne
Petrisor, Teodora
Irigoin, François
Lenormand, Eric

November 2013

International audienceIn this paper we concentrate on embedded parallel architectures with heterogen...

Communication Optimizations for Irregular Scientific Computations on Distributed Memory Architectures

Raja Das
Mustafa Uysal
Joel Saltz
Yuan-shin Hwang

January 1993

This paper describes a number of optimizations that can be used to support the efficient execution o...

Compiler-directed memory management for heterogeneous MPSoCs

Wang, Miao
Bodin, François

January 2011

International audienceAdvances in semiconductor technique enable multiple processor cores to be inte...

Automatic Data and Computation Decomposition on Distributed Memory Parallel Computers

Peizong Lee
Zvi M. Kedem

January 1999

On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...

Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures

Bondhugula, Uday

January 2013

We present new techniques for compilation of arbitrarily nested loops with affine dependences for di...

Combining data and computation distribution directives for hybrid parallel rogramming : a transformation system

HABEL, Rachid
Silber-Chaussumier, Frédérique
Irigoin, François
Brunet, Elisabeth
Trahay, François

December 2016

International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...

Automatic Data and Computation Decomposition for Distributed Memory Machines

Qi Ning
Vincent Van Dongen
Guang R. Gao
Vincent Van
Dongen Guang
R. Gao

January 1995

In this paper, we develop an automatic compile-time computation and data decomposition technique for...

Communication generation for data-parallel languages

Sethi, Ajay

January 1997

Data-parallel languages allow programmers to use the familiar machine-independent programming style ...

Automatic Data Mapping and Program Transformations

Ramanujam And
J. Ramanujam
A. Narayan

January 1995

This paper presents a technique for finding good distributions of arrays and suitable loop restructu...

and

M. Kandemir
J. Ramanujam
A. Choudhary

January 1998

Distributed-memory message-passing machines deliver scalable perfor-mance but are difficult to progr...

International Conference on Computational Science, ICCS 2012 A Theory of Data Movement in Parallel Computations

Eijkhout, Victor

December 2012

AbstractWe propose a set-theoretic model for parallelism. The model is based on separate distributio...

Compilers for Parallel Computing (CPC)

Moretón Fernández, Ana
González Escribano, Arturo
Llanos Ferraris, Diego Rafael

January 2015

Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...

Communication-free data allocation techniques for parallelizing compilers on multicomputers

Chen,Tzung-Shi
Sheu,Jang-Ping

[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...

A global communication optimization technique based on data-flow analysis and linear algebra

M. Kandemir
P. Banerjee
A. Choudhary
J. Ramanujam
N. Shenoy

January 1999

Reducing communication overhead is extremely important in distributed-memory message-passing archite...

Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Dathathri, Roshan
Reddy, Chandan
Ramashekar, Thejas
Bondhugula, Uday

January 2013

Programming for parallel architectures that do not have a shared address space is extremely difficul...

Automatic Generation of Communications for Redundant Multi-dimensional Data Parallel Redistributions

Ancourt, Corinne
Petrisor, Teodora
Irigoin, François
Lenormand, Eric

November 2013

International audienceIn this paper we concentrate on embedded parallel architectures with heterogen...

Communication Optimizations for Irregular Scientific Computations on Distributed Memory Architectures

Raja Das
Mustafa Uysal
Joel Saltz
Yuan-shin Hwang

January 1993

This paper describes a number of optimizations that can be used to support the efficient execution o...

Compiler-directed memory management for heterogeneous MPSoCs

Wang, Miao
Bodin, François

January 2011

International audienceAdvances in semiconductor technique enable multiple processor cores to be inte...

Automatic Data and Computation Decomposition on Distributed Memory Parallel Computers

Peizong Lee
Zvi M. Kedem

January 1999

On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...

Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures

Bondhugula, Uday

January 2013

We present new techniques for compilation of arbitrarily nested loops with affine dependences for di...

Combining data and computation distribution directives for hybrid parallel rogramming : a transformation system

HABEL, Rachid
Silber-Chaussumier, Frédérique
Irigoin, François
Brunet, Elisabeth
Trahay, François

December 2016

International audienceThis paper describes dstep, a directive-based programming model for hybrid sha...

Automatic Data and Computation Decomposition for Distributed Memory Machines

Qi Ning
Vincent Van Dongen
Guang R. Gao
Vincent Van
Dongen Guang
R. Gao

January 1995

In this paper, we develop an automatic compile-time computation and data decomposition technique for...

Communication generation for data-parallel languages

Sethi, Ajay

January 1997

Data-parallel languages allow programmers to use the familiar machine-independent programming style ...

Automatic Data Mapping and Program Transformations

Ramanujam And
J. Ramanujam
A. Narayan

January 1995

This paper presents a technique for finding good distributions of arrays and suitable loop restructu...

and

M. Kandemir
J. Ramanujam
A. Choudhary

January 1998

Distributed-memory message-passing machines deliver scalable perfor-mance but are difficult to progr...

International Conference on Computational Science, ICCS 2012 A Theory of Data Movement in Parallel Computations

Eijkhout, Victor

December 2012

AbstractWe propose a set-theoretic model for parallelism. The model is based on separate distributio...

Compilers for Parallel Computing (CPC)

Moretón Fernández, Ana
González Escribano, Arturo
Llanos Ferraris, Diego Rafael

January 2015

Producción CientíficaCurrent multicomputers are typically built as interconnected clusters of shared...

Communication-free data allocation techniques for parallelizing compilers on multicomputers

Chen,Tzung-Shi
Sheu,Jang-Ping

[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...

A global communication optimization technique based on data-flow analysis and linear algebra

M. Kandemir
P. Banerjee
A. Choudhary
J. Ramanujam
N. Shenoy

January 1999

Reducing communication overhead is extremely important in distributed-memory message-passing archite...

Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Abstract

Extracted data

Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Abstract

Extracted data

Related items

Related items