Recent research results show that conventional hardware-only cache solutions result in unsatisfactory cache utilization for both regular and irregular applications. To overcome this problem, a number of architectures introduce instruction hints to assist cache replacement. For example, Intel Itanium architecture augments memory accessing instructions with cache hints to distinguish data that will be referenced in the near future from the rest. With the availability of such methods, the performance of the underlying cache architecture critically depends on the ability of the compiler to generate code with appropriate cache hints. In this paper we formulate this problem – giving cache hints to memory instructions such that cache miss rate is...
We present a novel, compile-time method for determining the cache performance of the loop nests in a...
Cache memories in embedded systems play an important role in reducing the execution time of the appl...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Limited set-associativity in hardware caches can cause conflict misses when multiple data items map ...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
The performance of a traditional cache memory hierarchy can be improved by utilizing mechanisms such...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The widening gap between processor and memory speeds renders data locality optimization a very impor...
Limited set-associativity in hardware caches can cause conflict misses when multiple data items map ...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Cache performance is critical in cache-based supercomputers, where the cache-miss/cache-hit memory r...
Abstract — Data cache in embedded systems plays the roles of both speeding up program execution and ...
We address the problem of improving cache predictability and performance in embedded systems through...
We present a novel, compile-time method for determining the cache performance of the loop nests in a...
Cache memories in embedded systems play an important role in reducing the execution time of the appl...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
Limited set-associativity in hardware caches can cause conflict misses when multiple data items map ...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
The performance of a traditional cache memory hierarchy can be improved by utilizing mechanisms such...
The memory system remains a major performance bottleneck in modern and future architectures. In this...
The widening gap between processor and memory speeds renders data locality optimization a very impor...
Limited set-associativity in hardware caches can cause conflict misses when multiple data items map ...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Cache performance is critical in cache-based supercomputers, where the cache-miss/cache-hit memory r...
Abstract — Data cache in embedded systems plays the roles of both speeding up program execution and ...
We address the problem of improving cache predictability and performance in embedded systems through...
We present a novel, compile-time method for determining the cache performance of the loop nests in a...
Cache memories in embedded systems play an important role in reducing the execution time of the appl...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...