Abstract Intuitively, a cache-oblivious algorithm implements an adaptive strategy which runs efficiently on any memory hierarchy without requiring previous knowledge of the parameters of the hierarchy. For this reason, cache-obliviousness is an attractive feature of an algorithm meant for a global computing environment, where software may be run on a variety of different platforms for load management purposes. In this paper we present a negative result on cache-obliviousness, namely, we show that an optimal cache-oblivious algorithm for the fundamental primitive of matrix transposition cannot exist without the tall cache assumption, which forces the (unknown) parameters of the memory hierarchy to satisfy a certain technical relation. Our co...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
Matrix transposition is a fundamental operation, but it may present a very low and hardly predictabl...
Cache-oblivious algorithms are designed to be executed on the ideal-cache machine, an abstract machi...
Permuting a vector is a fundamental primitive which arises in many applications. In particular, rati...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
The hierarchical organization of the memory and communication systems and the availability of numero...
This report deals with the ecient calculation of matrix-matrix multiplication, without using explici...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
Memory efficiency and locality have substantial impact on the performance of programs, particularly ...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
Matrix transposition is a fundamental operation, but it may present a very low and hardly predictabl...
Cache-oblivious algorithms are designed to be executed on the ideal-cache machine, an abstract machi...
Permuting a vector is a fundamental primitive which arises in many applications. In particular, rati...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
The hierarchical organization of the memory and communication systems and the availability of numero...
This report deals with the ecient calculation of matrix-matrix multiplication, without using explici...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
Memory efficiency and locality have substantial impact on the performance of programs, particularly ...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
Matrix transposition is a fundamental operation, but it may present a very low and hardly predictabl...
Cache-oblivious algorithms are designed to be executed on the ideal-cache machine, an abstract machi...