In this paper we present randomized algorithms for sorting and convex hull that achieves optimal performance (for speed-up and cache misses) on the multicore model with private cache model. Our algorithms are cache oblivious and generalize the randomized divide and conquer strategy given by Reischuk [14] and Reif and Sen [17]. Although the approach yielded optimal speed-up in the PRAM model, we require additional techniques to optimize cache-misses in an oblivious setting. Let p, n,M,B respectively denote number of processors, problem size, the size of individual processor cache memory and block size respectively, then we obtain expected parallel running time O(np log n + log n log log n) with expected O ( nB logM n) cache misses for sortin...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
AbstractWe describe the first optimal randomized in-place algorithm for the basic 3-d convex hull pr...
We study randomized techniques for designing efficient algorithms on a p-processor bulk-synchronous ...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
We address the design of algorithms for multicores that are oblivious to machine parameters. We prop...
We address the design of algorithms for multicores that are oblivious to machine parameters. We prop...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
Abstract We adapt the distribution sweeping method to the cache oblivious model. Distribution sweepi...
We adapt the distribution sweeping method to the cache oblivious model. Distribution sweeping is the...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
The computation of a peeling order in a randomly generated hypergraph is the most time-consuming ste...
Rezaul Alam Chowdhury of Boston University presented a lecture on March 28, 2011 from 10:00 am to 11...
The computation of a peeling order in a randomly generated hypergraph is the most time-consuming ste...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
AbstractWe address the tradeoff between the competitive ratio and the resources used by randomized o...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
AbstractWe describe the first optimal randomized in-place algorithm for the basic 3-d convex hull pr...
We study randomized techniques for designing efficient algorithms on a p-processor bulk-synchronous ...
In this paper we explore a simple and general approach for developing parallel algorithms that lead ...
We address the design of algorithms for multicores that are oblivious to machine parameters. We prop...
We address the design of algorithms for multicores that are oblivious to machine parameters. We prop...
Abstract This paper presents asymptotically optimal algo-rithms for rectangular matrix transpose, FF...
Abstract We adapt the distribution sweeping method to the cache oblivious model. Distribution sweepi...
We adapt the distribution sweeping method to the cache oblivious model. Distribution sweeping is the...
In this work, we study the cache-oblivious computation model, which is inspired by the behaviour of ...
The computation of a peeling order in a randomly generated hypergraph is the most time-consuming ste...
Rezaul Alam Chowdhury of Boston University presented a lecture on March 28, 2011 from 10:00 am to 11...
The computation of a peeling order in a randomly generated hypergraph is the most time-consuming ste...
We introduce the cache-adaptive model, which generalizes the external-memory model to apply to envir...
AbstractWe address the tradeoff between the competitive ratio and the resources used by randomized o...
Cache-oblivious algorithms have been advanced as a way of circumventing some of the difficulties of ...
AbstractWe describe the first optimal randomized in-place algorithm for the basic 3-d convex hull pr...
We study randomized techniques for designing efficient algorithms on a p-processor bulk-synchronous ...