ABSTRACT In software-oriented multithread-ing execution model, the compiler identifies the re-mote accesses and performs fast context switches t,o hide remote =cess latency. In TAM model of execu-tion, threads access the local memory through a data structure called “frame”. This paper introduces a cache memory for frame structure and applies two techniques to reduce the cache miss ratio. One is a frame prefetching, which is based on the frame scheduling information, and the other is a changing frame execution sequences by the working frame set concept. Multithreading simulation is performed us-ing benchmark programs and causes of cache misses are classified and analyzed. This paper shows the promising result that the frame prefetching based...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
This paper presents new analytical models of the performance be-nefits of multithreading and prefetc...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Multithreading techniques used within computer processors aim to provide the computer system with ...
Multithreading can be used to hide latency in a non-blocking cache architecture. By switching execut...
This paper describes a method to improve the cache locality of sequential programs by scheduling fin...
Pre-execution attacks cache misses for which conventional address-prediction driven prefetching is i...
[[abstract]]©1998 JISE-A multithreaded computer maintains multiple program counters and register fil...
The speculated execution of threads in a multithreaded architecture plus the branch prediction used ...
Since long latency due to remote memory access could be tolerated by rapidly switching to another th...
Shared memory multiprocessors are considered among the easiest parallel computers to program. Howeve...
This is a presentation of initial ideas on techniques that can be used in order to achieve a predict...
Modern processors apply sophisticated techniques, such as deep cache hierarchies and hardware prefet...
Cache misses represent a major bottleneck in embedded systems performance. Traditionally, compilers ...
Once the cache memory was introduced in computer systems, the well-known gap in speeds between the m...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
This paper presents new analytical models of the performance be-nefits of multithreading and prefetc...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Multithreading techniques used within computer processors aim to provide the computer system with ...
Multithreading can be used to hide latency in a non-blocking cache architecture. By switching execut...
This paper describes a method to improve the cache locality of sequential programs by scheduling fin...
Pre-execution attacks cache misses for which conventional address-prediction driven prefetching is i...
[[abstract]]©1998 JISE-A multithreaded computer maintains multiple program counters and register fil...
The speculated execution of threads in a multithreaded architecture plus the branch prediction used ...
Since long latency due to remote memory access could be tolerated by rapidly switching to another th...
Shared memory multiprocessors are considered among the easiest parallel computers to program. Howeve...
This is a presentation of initial ideas on techniques that can be used in order to achieve a predict...
Modern processors apply sophisticated techniques, such as deep cache hierarchies and hardware prefet...
Cache misses represent a major bottleneck in embedded systems performance. Traditionally, compilers ...
Once the cache memory was introduced in computer systems, the well-known gap in speeds between the m...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
This paper presents new analytical models of the performance be-nefits of multithreading and prefetc...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...