We define a set of overhead functions that capture the salient artifacts representing the interaction between parallel application characteristics and architectural features. An execution-driven simulation testbed is used to separate these overheads in a parallel system. Using this testbed and a set of applications, we address two important issues. The first concerns the use of machine abstractions for performance studies of parallel systems. The second deals with quantifying the impact of locality on the performance of applications. The key conclusions from this study are that the newly proposed model LogP is an effective one for abstracting the network, and that ignoring locality can significantly affect the application...
AbstractWe argue in this paper that benchmarking should be complemented by direct measurement of par...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
We define a set of overhead functions that capture the salient artifacts representing the interactio...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
In this paper we present a new approach to benchmark the performance of shared memory systems. Thi...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
Thesis (Ph. D.)--University of Rochester. Department of Computer Science, 2017On modern processors, ...
The task parallel programming model allows programmers to express concurrency at a high level of abs...
It is often assumed that computational load balance cannot be achieved in parallel and distributed s...
Scability studies of parallel architectures have used scalar metrics to evaluate their performan...
Abstract. We present locality-based abstractions, in which a set of states of a distributed system i...
AbstractWe argue in this paper that benchmarking should be complemented by direct measurement of par...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
We define a set of overhead functions that capture the salient artifacts representing the interactio...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
In this paper we present a new approach to benchmark the performance of shared memory systems. Thi...
The overheads in a parallel system that limit its scalability need to be identified and separated in...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
Thesis (Ph. D.)--University of Rochester. Department of Computer Science, 2017On modern processors, ...
The task parallel programming model allows programmers to express concurrency at a high level of abs...
It is often assumed that computational load balance cannot be achieved in parallel and distributed s...
Scability studies of parallel architectures have used scalar metrics to evaluate their performan...
Abstract. We present locality-based abstractions, in which a set of states of a distributed system i...
AbstractWe argue in this paper that benchmarking should be complemented by direct measurement of par...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...