AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particular, rational permutations, which are defined by permutations of the bits of the binary representations of the vector indices, are widely used. Matrix transposition and bit-reversal are notable examples of rational permutations. In this paper we contribute a number of results regarding the execution of these permutations in cache hierarchies, with particular emphasis on the cache-oblivious setting. We first bound from below the work needed to execute a rational permutation with an optimal cache complexity. Then, we develop a cache-oblivious algorithm to perform any rational permutation, which exhibits optimal work and cache complexities under ...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Permuting a vector is a fundamental primitive which arises in many applications. In particular, rati...
Abstract Intuitively, a cache-oblivious algorithm implements an adaptive strategy which runs efficie...
The hierarchical organization of the memory and communication systems and the availability of numero...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
We describe a model that enables us to analyze the running time of an algorithm in a computer with a...
The VAT-model (virtual address translation model) extends the EM-model (external memory model) and t...
Memory efficiency and locality have substantial impact on the performance of programs, particularly ...
Cache-oblivious algorithms are designed to be executed on the ideal-cache machine, an abstract machi...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...
AbstractPermuting a vector is a fundamental primitive which arises in many applications. In particul...
Permuting a vector is a fundamental primitive which arises in many applications. In particular, rati...
Abstract Intuitively, a cache-oblivious algorithm implements an adaptive strategy which runs efficie...
The hierarchical organization of the memory and communication systems and the availability of numero...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
We describe a model that enables us to analyze the running time of an algorithm in a computer with a...
The VAT-model (virtual address translation model) extends the EM-model (external memory model) and t...
Memory efficiency and locality have substantial impact on the performance of programs, particularly ...
Cache-oblivious algorithms are designed to be executed on the ideal-cache machine, an abstract machi...
) Sandeep Sen y Siddhartha Chatterjee z Submitted for publication Abstract We describe a model...
Cache-oblivious algorithms are well understood when the cache size remains constant. Recently variab...
In this thesis we introduce a cost measure to compare the cache- friendliness of different permutati...