Tolerating First Level Memory Access Latency In High-Performance Systems

William Chen
William Y. Chen
Scott A. Mahlke
Wen-mei W. Hwu

Publication date

January 1992

Abstract

In order to improve performance, future parallel systems will continue to increase the processing power of each node in a system. As node processors, though, can execute more instructions concurrently, they become more sensitive to the first level memory access latency. This paper presents a set of hardware and software techniques, collectively referred to as register preloading, to effectively tolerate long first level memory access latency. The techniques include speculative execution, loop unrolling, dynamic memory disambiguation, and strip-mining. Results show that register preloading provides excellent tolerance to first level memory access latency up to 16 cycles for an issue 4 node processor. INTRODUCTION The objective of designing ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Tolerating First Level Memory Access Latency In High-Performance Systems

Abstract

Extracted data

Tolerating First Level Memory Access Latency In High-Performance Systems

Abstract

Extracted data

Related items

Related items