We define a set of overhead functions that capture the salient artifacts representing the interaction between parallel application characteristics and architectural features. An execution-driven simulation testbed is used to separate these overheads in a parallel system. Using this testbed and a set of applications, we address two important issues. The first concerns the use of machine abstractions for performance studies of parallel systems. The second deals with quantifying the impact of locality on the performance of applications. The key conclusions from this study are that the newly proposed model LogP is an effective one for abstracting the network, and that ignoringlocality can significantly affect the application performance
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
We define a set of overhead functions that capture the salient artifacts representing the interact...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
Abstract. We present locality-based abstractions, in which a set of states of a distributed system i...
It is often assumed that computational load balance cannot be achieved in parallel and distributed s...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
. This paper studies the locality analysis problem for sharedmemory multiprocessors, a class of para...
Traditionally, in distributed memory architectures, locality maintenance and load balancing are seen...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Thesis (Ph. D.)--University of Rochester. Department of Computer Science, 2017On modern processors, ...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
We define a set of overhead functions that capture the salient artifacts representing the interact...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
Abstract. We present locality-based abstractions, in which a set of states of a distributed system i...
It is often assumed that computational load balance cannot be achieved in parallel and distributed s...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
Scalability studies of parallel architectures have used scalar metrics to evaluate their performance...
. This paper studies the locality analysis problem for sharedmemory multiprocessors, a class of para...
Traditionally, in distributed memory architectures, locality maintenance and load balancing are seen...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Thesis (Ph. D.)--University of Rochester. Department of Computer Science, 2017On modern processors, ...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
The exploitation of locality of reference in shared memory multiprocessors is one of the most import...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...