Permuting a vector is a fundamental primitive which arises in many applications. In particular, rational permutations, which are defined by permutations of the bits of the binary representations of the vector indices, are widely used. Matrix transposition and bit-reversal are notable examples of rational permutations. In this paper we contribute a number of results regarding the execution of these permutations in cache hierarchies, with particular emphasis on the cache-oblivious setting. We first bound from below the work needed to execute a rational permutation with an optimal cache complexity. Then, we develop a cache-oblivious algorithm to perform any rational permutation, which exhibits optimal work and cache complexities under the tall...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Abstract Intuitively, a cache-oblivious algorithm implements an adaptive strategy which runs efficie...
The hierarchical organization of the memory and communication systems and the availability of numero...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
Cache-obliviousness is a concept first introduced by Frigo et al. in [1]. We follow their model and ...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Abstract Intuitively, a cache-oblivious algorithm implements an adaptive strategy which runs efficie...
The hierarchical organization of the memory and communication systems and the availability of numero...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
We present a model that enables us to analyze the running time of an algorithm on a computer with a ...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
Cache-obliviousness is a concept first introduced by Frigo et al. in [1]. We follow their model and ...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
AbstractOne of the keys to tap the full performance potential of current hardware is the optimal uti...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...