The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to video and image compression. We show that this transform, by means of the lifting scheme, can be performed in a memory and computation-efficient way on modern, programmable GPUs, which can be regarded as massively parallel coprocessors through NVidia's CUDA compute paradigm. The three main hardware architectures for the 2D DWT (row-column, line-based, block-based) are shown to be unsuitable for a CUDA implementation. Our CUDA-specific design can be regarded as a hybrid method between the row-column and block-based methods. We achieve considerable speedups compared to an optimized CPU implementation and earlier non-CUDA-based GPU DWT methods, both...
The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorith...
In this brief, we propose a new parallel lifting-based 2-D DWT architecture with high memory efficie...
Abstract—The release of the CUDA Kepler architecture in March 2012 has provided Nvidia GPUs with a l...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The DiscreteWavelet Transform (DWT) has a wide range of applications from signal processing to video...
In order to accelerate the Discrete Wavelet Transform DWT, we have implemented and compared the lift...
AbstractGPUs have recently attracted our attention as accelerators on a wide variety of algorithms, ...
Discrete wavelet transform (DWT) has been widely used in many image compression applications, such a...
The Discrete Haar Wavelet Transform has a wide range of applications from signal processing to video...
The release of the CUDA Kepler architecture in March 2012 has provided Nvidia GPUs with a larger reg...
The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorith...
In this brief, we propose a new parallel lifting-based 2-D DWT architecture with high memory efficie...
Abstract—The release of the CUDA Kepler architecture in March 2012 has provided Nvidia GPUs with a l...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The Discrete Wavelet Transform (DWT) has a wide range of applications from signal processing to vide...
The DiscreteWavelet Transform (DWT) has a wide range of applications from signal processing to video...
In order to accelerate the Discrete Wavelet Transform DWT, we have implemented and compared the lift...
AbstractGPUs have recently attracted our attention as accelerators on a wide variety of algorithms, ...
Discrete wavelet transform (DWT) has been widely used in many image compression applications, such a...
The Discrete Haar Wavelet Transform has a wide range of applications from signal processing to video...
The release of the CUDA Kepler architecture in March 2012 has provided Nvidia GPUs with a larger reg...
The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorith...
In this brief, we propose a new parallel lifting-based 2-D DWT architecture with high memory efficie...
Abstract—The release of the CUDA Kepler architecture in March 2012 has provided Nvidia GPUs with a l...