We report on a project to develop a unified approach for building a library of collective communication operations that performs well on a cross-section of problems encountered in real applications. The target architecture is a two-dimensional mesh with worm-hole routing, but the techniques are more general. The approach differs from traditional library implementations in that we address the need for implementations that perform well for various sized vectors and grid dimensions, including non-power-of-two grids. We show how a general approach to hybrid algorithms yields performance across the entire range of vector lengths. Moreover, many scalable implementations of application libraries require collective communication within groups of no...
Partitioned Global Address Space (PGAS) languages offer programmers the convenience of a shared memo...
In this paper, we consider the communications involved in the execution of a complex application, de...
We discuss the design and high-performance implementation of collective communications operations on...
We report on a project to develop a unified approach for building a library of collective communicat...
In this paper, we outline a unified approach for building a library of collective communication oper...
[[abstract]]Most MPC networks use wormhole routing to reduce the effect of path length on communicat...
: The significance of collective communication operations for scalable parallel systems has been wel...
Collective communication allows efficient communication and synchronization among a collection of pr...
Massively parallel computers (MPC) are characterized by the distribution of memory among an ensemble...
Technology trends suggest that future machines will relyon parallelism to meet increasing performanc...
Technology trends suggest that future machines will rely on parallelism to meet increasing performan...
High Performance Computing (HPC) systems interconnect a large number of Processing Elements (PEs) in...
pre-printThe placement of tasks in a parallel application on specific nodes of a supercomputer can s...
127 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.In this thesis, we motivate t...
The ecient implementation of collective communication operations has received much attention. Initia...
Partitioned Global Address Space (PGAS) languages offer programmers the convenience of a shared memo...
In this paper, we consider the communications involved in the execution of a complex application, de...
We discuss the design and high-performance implementation of collective communications operations on...
We report on a project to develop a unified approach for building a library of collective communicat...
In this paper, we outline a unified approach for building a library of collective communication oper...
[[abstract]]Most MPC networks use wormhole routing to reduce the effect of path length on communicat...
: The significance of collective communication operations for scalable parallel systems has been wel...
Collective communication allows efficient communication and synchronization among a collection of pr...
Massively parallel computers (MPC) are characterized by the distribution of memory among an ensemble...
Technology trends suggest that future machines will relyon parallelism to meet increasing performanc...
Technology trends suggest that future machines will rely on parallelism to meet increasing performan...
High Performance Computing (HPC) systems interconnect a large number of Processing Elements (PEs) in...
pre-printThe placement of tasks in a parallel application on specific nodes of a supercomputer can s...
127 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2005.In this thesis, we motivate t...
The ecient implementation of collective communication operations has received much attention. Initia...
Partitioned Global Address Space (PGAS) languages offer programmers the convenience of a shared memo...
In this paper, we consider the communications involved in the execution of a complex application, de...
We discuss the design and high-performance implementation of collective communications operations on...