N-dimensional transpose/permutation is a very important operation in many large-scale data intensive and scientific applications. These applications include but not limited to oil industry i.e. seismic data processing, nuclear medicine, media production, digital signal processing and business intelligence. This paper proposes an efficient in-place N-dimensional permutation algorithm. The algorithm is based on a novel 3D transpose algorithm that was published recently. The proposed algorithm has been tested on 3D, 4D, 5D, 6D and 7D data sets as a proof of concept. This is the first contribution which is breaking the dimensions’ limitation of the base algorithm. The suggested algorithm exploits the idea of mixing both logical and physical per...
This paper studies the CUDA programming challenges with using multiple GPUs inside a single machine ...
Many simulations in the physical sciences are expressed in terms of rectilinear arrays of variables....
Optimal usage of the memory system is a key element of fast GPU algorithms. Unfortunately many commo...
AbstractN-dimensional transpose/permutation is a very important operation in many large-scale data i...
In this paper we present a data parallel volume rendering algorithm with numerous advantages over pr...
Matrix transposition is an important algorithmic building block for many numeric algorithms like m...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
volume visualization, parallel algorithms © Copyright Hewlett-Packard Company 1998 Biomedical volume...
[[abstract]]The authors present a parallel algorithm to generate the permutations of at most k out o...
Transposing an N × N array that is distributed row- or column-wise across P = N processors is a fund...
International audienceThis work presents a GPU-based backtracking algorithm for permutation combinat...
International audienceWe tackle the feasibility and efficiency of two new parallel algorithms that s...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
In the medical field, volume rendering provides good quality 3D visualizations but is still not enou...
The permutation generation method is based on starter sets generation under exchange operation and e...
This paper studies the CUDA programming challenges with using multiple GPUs inside a single machine ...
Many simulations in the physical sciences are expressed in terms of rectilinear arrays of variables....
Optimal usage of the memory system is a key element of fast GPU algorithms. Unfortunately many commo...
AbstractN-dimensional transpose/permutation is a very important operation in many large-scale data i...
In this paper we present a data parallel volume rendering algorithm with numerous advantages over pr...
Matrix transposition is an important algorithmic building block for many numeric algorithms like m...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
volume visualization, parallel algorithms © Copyright Hewlett-Packard Company 1998 Biomedical volume...
[[abstract]]The authors present a parallel algorithm to generate the permutations of at most k out o...
Transposing an N × N array that is distributed row- or column-wise across P = N processors is a fund...
International audienceThis work presents a GPU-based backtracking algorithm for permutation combinat...
International audienceWe tackle the feasibility and efficiency of two new parallel algorithms that s...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
In the medical field, volume rendering provides good quality 3D visualizations but is still not enou...
The permutation generation method is based on starter sets generation under exchange operation and e...
This paper studies the CUDA programming challenges with using multiple GPUs inside a single machine ...
Many simulations in the physical sciences are expressed in terms of rectilinear arrays of variables....
Optimal usage of the memory system is a key element of fast GPU algorithms. Unfortunately many commo...