In this paper we present a new approach to benchmark the performance of shared memory systems. This approach focuses on recognizing how far off is the performance of a given memory system from a realistic ideal parallel machine. We define such a realistic machine model called the z-machine, which accounts for the inherent communication costs in an application by tracking the data flow in the application. The z-machine is incorporated into an execution-driven simulation framework to be used as a reference for benchmarking for different memory systems. The components of the overheads in these memory systems are identified and quantified for four applications. Using the z-machine performance as the standard to strive for we discu...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
We address the gap between structured parallel programming and parallel architectures by formalizing...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
We define a set of overhead functions that capture the salient artifacts representing the interact...
AbstractWe argue in this paper that benchmarking should be complemented by direct measurement of par...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
Parallel computing is essential for solving very large scientific and engineering problems. An effec...
This thesis presents a unified approach to modeling of parallel architectures and algorithms with sp...
Thinking Machines\u27 CM-5 machine is a distributed-memory, message-passing computer. In this paper ...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
We address the gap between structured parallel programming and parallel architectures by formalizing...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
We define a set of overhead functions that capture the salient artifacts representing the interact...
AbstractWe argue in this paper that benchmarking should be complemented by direct measurement of par...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
Parallel computing is essential for solving very large scientific and engineering problems. An effec...
This thesis presents a unified approach to modeling of parallel architectures and algorithms with sp...
Thinking Machines\u27 CM-5 machine is a distributed-memory, message-passing computer. In this paper ...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...
While parallel computing offers an attractive perspective for the future, developing efficient paral...