Abstract – In this paper we present a new approach to benchmark the performance of shared memory systems. This approach focuses on recognizing how far off the performance of a given memory system is from a realistic ideal parallel machine. We define such a realistic machine model, called the z-machine, that accounts for the inherent communication costs in an application by tracking the data flow in the application. The z-machine is incorporated into an execution-driven simulation framework and is used as a reference for benchmarking different memory systems. The components of the overheads in these memory systems are identified and quantified for four applications. Using the z-machine performance as the standard to strive for we discuss the...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
The goal of this work was to examine existing shared memory parallel programming models, figure out ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
This paper develops and validates an analytical model for evaluating various types of architectural ...
This thesis presents a unified approach to modeling of parallel architectures and algorithms with sp...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
Parallel computing is essential for solving very large scientific and engineering problems. An effec...
Shared memory models have been criticized for years for failing to model essential realities of para...
We address the gap between structured parallel programming and parallel architectures by formalizing...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
The goal of this work was to examine existing shared memory parallel programming models, figure out ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
The CPUs, memory, interconnection network, operating system, runtime system, I/O subsystem, and appl...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
This paper develops and validates an analytical model for evaluating various types of architectural ...
This thesis presents a unified approach to modeling of parallel architectures and algorithms with sp...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
Parallel computing is essential for solving very large scientific and engineering problems. An effec...
Shared memory models have been criticized for years for failing to model essential realities of para...
We address the gap between structured parallel programming and parallel architectures by formalizing...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Prediction of the performance of parallel applications is a concept useful in several domains of sof...
The goal of this work was to examine existing shared memory parallel programming models, figure out ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...