As memory access times grow larger relative to processor cycle times, the cache performance of algorithms has an increasingly large impact on overall performance. Unfortunately, most commonly used algorithms were not designed with cache performance in mind. This paper investigates the cache performance of implicit heaps. We present optimizations which significantly reduce the cache misses that heaps incur and improve their overall performance. We present an analytical model called collective analysis that allows cache performance to be predicted as a function of both cache configuration and algorithm configuration. As part of our investigation, we perform an approximate analysis of the cache performance of both traditional heaps and our imp...
Because of the infeasibility or expense of large fully-associative caches, cache memories are often ...
A new performance model of the memory hierarchy is first introduced, which describes all possible sc...
The research software URDME makes use of a priority queue that has support for updating the priority...
The theory community has proposed several new heap variants in the recent past which have remained l...
Abstract We investigate the effect that caches have on the performance of sorting algorithms both ex...
We investigate the effect that caches have on the performance of sorting algorithms both experimenta...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
In this report, we consider the design of cache memories to support the execution of languages that ...
This paper describes a model for studying the cache performance of algorithms in a direct-mapped cac...
This paper describes a model for studying the cache performance of algorithms in a direct-mapped cac...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
We describe a model that enables us to analyze the running time of an algorithm in a computer with a...
Cache behavior is complex and inherently unstable, yet it is a critical factor affecting program per...
. To deepen our quantitative understanding of the performance of lazy evaluation, we have studied th...
Because of the infeasibility or expense of large fully-associative caches, cache memories are often ...
A new performance model of the memory hierarchy is first introduced, which describes all possible sc...
The research software URDME makes use of a priority queue that has support for updating the priority...
The theory community has proposed several new heap variants in the recent past which have remained l...
Abstract We investigate the effect that caches have on the performance of sorting algorithms both ex...
We investigate the effect that caches have on the performance of sorting algorithms both experimenta...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
In this report, we consider the design of cache memories to support the execution of languages that ...
This paper describes a model for studying the cache performance of algorithms in a direct-mapped cac...
This paper describes a model for studying the cache performance of algorithms in a direct-mapped cac...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
We describe a model that enables us to analyze the running time of an algorithm in a computer with a...
Cache behavior is complex and inherently unstable, yet it is a critical factor affecting program per...
. To deepen our quantitative understanding of the performance of lazy evaluation, we have studied th...
Because of the infeasibility or expense of large fully-associative caches, cache memories are often ...
A new performance model of the memory hierarchy is first introduced, which describes all possible sc...
The research software URDME makes use of a priority queue that has support for updating the priority...