This dissertation considers the use of data prefetching and an alternative mechanism, data forwarding, for reducing memory latency due to interprocessor communication in cache coherent, shared memory multiprocessors. The benefits of prefetching and forwarding are considered for large, numerical application codes with loop-level and vector parallelism. Data prefetching is applied to these applications using two different multiprocessor prefetching algorithms implemented within a parallelizing compiler. Data forwarding considers array references involved in communication-related accesses between successive parallel loops, rather than within a single loop nest. A hybrid prefetching and forwarding scheme and a compiler algorithm for data forwar...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Despite rapid increases in CPU performance, the primary obstacles to achieving higher performance in...
While many parallel applications exhibit good spatial locality, other important codes in areas like ...
This dissertation considers the use of data prefetching and an alternative mechanism, data forwardin...
Abstract As the difference in speed between processor and memory system continues to increase, it is...
Memory latency has always been a major issue in shared-memory multiprocessors and high-speed systems...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
The gap between processor speed and memory latency has led to the use of caches in the memory system...
Memory latency becoming an increasing important performance bottleneck as the gap between processor ...
This thesis considers two approaches to the design of high-performance computers. In a single proces...
Abstract Data prefetching is an effective data access latency hiding technique to mask the CPU stall...
This document describes a set of new techniques for improving the efficiency of compiler-directed so...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Despite rapid increases in CPU performance, the primary obstacles to achieving higher performance in...
While many parallel applications exhibit good spatial locality, other important codes in areas like ...
This dissertation considers the use of data prefetching and an alternative mechanism, data forwardin...
Abstract As the difference in speed between processor and memory system continues to increase, it is...
Memory latency has always been a major issue in shared-memory multiprocessors and high-speed systems...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
The gap between processor speed and memory latency has led to the use of caches in the memory system...
Memory latency becoming an increasing important performance bottleneck as the gap between processor ...
This thesis considers two approaches to the design of high-performance computers. In a single proces...
Abstract Data prefetching is an effective data access latency hiding technique to mask the CPU stall...
This document describes a set of new techniques for improving the efficiency of compiler-directed so...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Despite rapid increases in CPU performance, the primary obstacles to achieving higher performance in...
While many parallel applications exhibit good spatial locality, other important codes in areas like ...