This article focuses on principles for the design of efficient parallel algorithms for distributed memory computing systems. We describe the general trend in the development of architectural properties and evaluate the state-of-the-art in a number of basic primitives like graph embedding, partitioning, dynamic load distribution, and communication which are used, to some extent, within all parallel applications. We discuss possible directions for future work on the design of universal basic primitives, able to perform efficiently on a broad range of parallel systems and applications, and we also give certain examples of specific applications which demand specialized basic primitives in order to obtain efficient parallel implementations. Fina...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
Load balancing in large parallel systems with distributed memory is a difficult task often influenci...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Parallel computation presents problems which are either nonexistent or trivial in the context of seq...
This paper describes the process used to extend the Boost Graph Library (BGL) for parallel operation...
Frames will provide support for the programming of distributed memory machines via a library of basi...
In this paper we want to demonstrate the large impact of theoretical considerations on the design an...
We present the design and implementation of a parallel algorithm for computing Gröbner bases on dist...
AbstractA scheme of an efficient general-purpose parallel computer is introduced. Its design space (...
AbstractThis paper introduces an architecture-independent, hierarchical approach to algorithm design...
. This session explores, through the use of formal methods, the "intuition" used in creati...
The objective of this thesis is the unified investigation of a wide range of fundament...
Parallel computing is increasingly exposed to the development and challengesof distributed systems, ...
this paper, we describe how the efficiency can be increased on distributed memory systems by exploit...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
Load balancing in large parallel systems with distributed memory is a difficult task often influenci...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Parallel computation presents problems which are either nonexistent or trivial in the context of seq...
This paper describes the process used to extend the Boost Graph Library (BGL) for parallel operation...
Frames will provide support for the programming of distributed memory machines via a library of basi...
In this paper we want to demonstrate the large impact of theoretical considerations on the design an...
We present the design and implementation of a parallel algorithm for computing Gröbner bases on dist...
AbstractA scheme of an efficient general-purpose parallel computer is introduced. Its design space (...
AbstractThis paper introduces an architecture-independent, hierarchical approach to algorithm design...
. This session explores, through the use of formal methods, the "intuition" used in creati...
The objective of this thesis is the unified investigation of a wide range of fundament...
Parallel computing is increasingly exposed to the development and challengesof distributed systems, ...
this paper, we describe how the efficiency can be increased on distributed memory systems by exploit...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
Load balancing in large parallel systems with distributed memory is a difficult task often influenci...
This paper describes a number of optimizations that can be used to support the efficient execution o...