Cache performance is critical in cache-based supercomputers, where the cache-miss/cache-hit memory reference delay ratio is typically large. Using compile-time analysis, program behavior can be predicted, and cache control directives can be embedded in the generated code. Thus, cache performance can be improved in a way not possible using conventional techniques. Given hardware able to selectively bypass the cache, cache performance can be increased because pollution can be minimized. Cache line replacement can also be controlled by the compiler (rather than by LRU, etc.), further enhancing performance. The research consists of the development of a model and algorithms providing optimal, or near optimal, cache performance by compiler manage...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
The performance of a traditional cache memory hierarchy can be improved by utilizing mechanisms such...
The potential of high-performance systems, especially parallel machines, is generally limited by the...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
Measurements of actual supercomputer cache performance has not been previously undertaken. PFC-Sim i...
Introduction As the microprocessor industry struggles to deliver higher performance superscalar and...
We present a novel, compile-time method for determining the cache performance of the loop nests in a...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Recent research results show that conventional hardware-only cache solutions result in unsatisfactor...
We address the problem of improving cache predictability and performance in embedded systems through...
The cache coherence maintenance problem has been the major obstacle in using private cache memory to...
In recent innovation particularly in the modern fields, the PCs are taken advantage of as controllin...
To reduce the average time needed to perform a read or a write access in a multiprocessor, a cache i...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
The performance of a traditional cache memory hierarchy can be improved by utilizing mechanisms such...
The potential of high-performance systems, especially parallel machines, is generally limited by the...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
Measurements of actual supercomputer cache performance has not been previously undertaken. PFC-Sim i...
Introduction As the microprocessor industry struggles to deliver higher performance superscalar and...
We present a novel, compile-time method for determining the cache performance of the loop nests in a...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Recent research results show that conventional hardware-only cache solutions result in unsatisfactor...
We address the problem of improving cache predictability and performance in embedded systems through...
The cache coherence maintenance problem has been the major obstacle in using private cache memory to...
In recent innovation particularly in the modern fields, the PCs are taken advantage of as controllin...
To reduce the average time needed to perform a read or a write access in a multiprocessor, a cache i...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
The performance of a traditional cache memory hierarchy can be improved by utilizing mechanisms such...
The potential of high-performance systems, especially parallel machines, is generally limited by the...