This paper presents a 3GPP LTE compliant turbo decoder accelerator on GPU. The challenge of implementing a turbo decoder is finding an efficient mapping of the decoder algorithm on GPU, e.g. finding a good way to parallelize workload across cores and allocate and use fast on-die memory to improve throughput. In our implementation, we increase throughput through 1) distributing the decoding workload for a codeword across multiple cores, 2) decoding multiple codewords simultaneously to increase concurrency and 3) employing memory optimization techniques to reduce memory bandwidth requirements. In addition, we analyze how different MAP algorithm approximations affect both throughput and bit error rate (BER) performance of this decoder
To meet the high throughput requirement of communication systems, the design of high-throughput low-...
Evolution) wireless communication standard is among the most challenging tasks in terms of computati...
Parallel implementations of Turbo decoding has been studied extensively. Traditionally, the number o...
International audienceThis paper presents a high-throughput implementation of a portable software tu...
The graphics processor unit (GPU) is able to provide a low-cost and flexible software-based multi-co...
Turbo codes comprising a parallel concatenation of upper and lower convolutional codes are widely em...
In many wireless systems, a Turbo decoder is often combined with a soft-output multiple-input and mu...
Decoding latency of the turbo decoder has been a serious problem in real-time processing of communic...
Low-density parity check (LDPC) codes have been extensively applied in mobile communication systems ...
This paper compares two implementations of reconfigurable and high-throughput turbo decoders. The fi...
The use of many-core processors such as general purpose Graphic Processing Units (GPUs) has recently...
© 2017 IEEE. Implementation of an efficient turbo decoder with low complexity, short delay and insig...
In wireless communication schemes, turbo codes facilitate near-capacity transmission throughputs by ...
During past several years, there has been a trend that the modern mobile SoC (system-on-chip) chipse...
In order to meet the latency requirements of the Ultra-Reliable Low Latency Communication (URLLC) mo...
To meet the high throughput requirement of communication systems, the design of high-throughput low-...
Evolution) wireless communication standard is among the most challenging tasks in terms of computati...
Parallel implementations of Turbo decoding has been studied extensively. Traditionally, the number o...
International audienceThis paper presents a high-throughput implementation of a portable software tu...
The graphics processor unit (GPU) is able to provide a low-cost and flexible software-based multi-co...
Turbo codes comprising a parallel concatenation of upper and lower convolutional codes are widely em...
In many wireless systems, a Turbo decoder is often combined with a soft-output multiple-input and mu...
Decoding latency of the turbo decoder has been a serious problem in real-time processing of communic...
Low-density parity check (LDPC) codes have been extensively applied in mobile communication systems ...
This paper compares two implementations of reconfigurable and high-throughput turbo decoders. The fi...
The use of many-core processors such as general purpose Graphic Processing Units (GPUs) has recently...
© 2017 IEEE. Implementation of an efficient turbo decoder with low complexity, short delay and insig...
In wireless communication schemes, turbo codes facilitate near-capacity transmission throughputs by ...
During past several years, there has been a trend that the modern mobile SoC (system-on-chip) chipse...
In order to meet the latency requirements of the Ultra-Reliable Low Latency Communication (URLLC) mo...
To meet the high throughput requirement of communication systems, the design of high-throughput low-...
Evolution) wireless communication standard is among the most challenging tasks in terms of computati...
Parallel implementations of Turbo decoding has been studied extensively. Traditionally, the number o...