The recent advent of parallel machines with tens of thousands of processors is pre-senting new challenges for obtaining scalability. A particular challenge for large-scale scientific software is determining the inter-processor communications required by the computation when a global description of the data is unavailable or too costly to store. We present a type of rendezvous algorithm that determines communication partners in a scalable manner by assuming the global distribution of the data. We demonstrate the scaling properties of the algorithm on up to 32,000 processors in the context of determining communication patterns for a matrix-vector multiply in the hypre software library. Our algorithm is very general and is applicable to a vari...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
International audienceWe propose a directed hypergraph model and a refinement heuristic to distribut...
Ali Pinary and Bruce Hendricksonz 1 Introduction In this work we address two problems associated wit...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
In this book chapter, the authors discuss some important communication issues to obtain a highly sca...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
In irregular all-to-all communication, messages are exchanged between every pair of processors. The ...
In this paper we propose a new approach to the study of the communication requirements of distribute...
Abstract-Consider a message-passing system of n processors, in which each processor holds one piece ...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
The current trends in high performance computing show that large machines with tens of thousands of ...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
International audienceWe propose a directed hypergraph model and a refinement heuristic to distribut...
Ali Pinary and Bruce Hendricksonz 1 Introduction In this work we address two problems associated wit...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
In this book chapter, the authors discuss some important communication issues to obtain a highly sca...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
In irregular all-to-all communication, messages are exchanged between every pair of processors. The ...
In this paper we propose a new approach to the study of the communication requirements of distribute...
Abstract-Consider a message-passing system of n processors, in which each processor holds one piece ...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
The current trends in high performance computing show that large machines with tens of thousands of ...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
International audienceWe propose a directed hypergraph model and a refinement heuristic to distribut...