This article investigates the recursive Morton ordering of two-dimensional arrays as an efficient way to access hierarchical memory across a range of heterogeneous computer platforms, ranging from manycore devices, multicore processors, clusters and distributed environments. A brief overview of previous research in this area is given, and algorithms that make use of Morton ordering are described. These are then used to investigate the efficiency of the Morton ordering approach by performance experiments on different processors. In particular, timing results are presented for matrix multiplication, Cholesky factorization and fast Fourier transform algorithms. The use of the Morton ordering approach leads naturally to algorithms that are recu...
The importance of tiles or blocks in mathematics and thus computer science cannot be overstated. Fro...
Abstract. Morton layout is a compromise storage layout between the programming language mandated lay...
The optimal implementation of matrix multiplication on modern computer architectures is of great imp...
This article investigates the recursive Morton ordering of two-dimensional arrays as an efficient wa...
A proof of concept is offered for the uniform representation of matrices serially in Morton-order (o...
Strassen's algorithm for matrix multiplication gains its lower arithmetic complexityatthe expe...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
Hierarchically-blocked non-linear storage layouts, such as the Morton ordering, have been proposed a...
Submitted for publication to IEEE TPDS The performance of both serial and parallel implementations o...
Abstract. We present a recursive way to partition hypergraphs which creates and exploits hypergraph ...
In this paper we introduce parallel versions of two hierarchical memory models and give optimal algo...
Abstract. A style for programming problems from matrix algebra is developed with a familiar example ...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
Hierarchical matrix (H-matrix) techniques can be used to efficiently treat dense matrices. With an H...
Hierarchically-blocked non-linear storage layouts, such as the Morton ordering, have been shown to b...
The importance of tiles or blocks in mathematics and thus computer science cannot be overstated. Fro...
Abstract. Morton layout is a compromise storage layout between the programming language mandated lay...
The optimal implementation of matrix multiplication on modern computer architectures is of great imp...
This article investigates the recursive Morton ordering of two-dimensional arrays as an efficient wa...
A proof of concept is offered for the uniform representation of matrices serially in Morton-order (o...
Strassen's algorithm for matrix multiplication gains its lower arithmetic complexityatthe expe...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
Hierarchically-blocked non-linear storage layouts, such as the Morton ordering, have been proposed a...
Submitted for publication to IEEE TPDS The performance of both serial and parallel implementations o...
Abstract. We present a recursive way to partition hypergraphs which creates and exploits hypergraph ...
In this paper we introduce parallel versions of two hierarchical memory models and give optimal algo...
Abstract. A style for programming problems from matrix algebra is developed with a familiar example ...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
Hierarchical matrix (H-matrix) techniques can be used to efficiently treat dense matrices. With an H...
Hierarchically-blocked non-linear storage layouts, such as the Morton ordering, have been shown to b...
The importance of tiles or blocks in mathematics and thus computer science cannot be overstated. Fro...
Abstract. Morton layout is a compromise storage layout between the programming language mandated lay...
The optimal implementation of matrix multiplication on modern computer architectures is of great imp...