In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evaluation algorithm. Eval-uation of these likelihoods is one of the most computationally intensive parts of automatics speech recognizers but it can be well-parallelized and offloaded to GPU devices. Our approach offers significant speed-up compared to the recently published approaches, since it exploits the GPU architecture better. All the recent implementations were programmed either in CUDA or OpenCL GPU programming frameworks. We present results for both; CUDA as well as OpenCL. Results suggest that even very large acoustic models can be utilized in real-time speech recognition engines on com-puters and laptops equipped with a low-end GPU. Opt...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evalu...
In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihoo...
Automatic speech recognition (ASR) is a very demanding computing task. Much research has been done i...
Abstract — Gaussian mixture models (GMMs) are often used in various data processing and classificati...
This paper introduces the use of Graphics Processors Unit (GPU) for computing acoustic likelihoods i...
Gaussian Mixture Model (GMM) computations in modern Automatic Speech Recognition systems are known t...
Gaussian Mixture Model (GMM) statistics are required for maximum likelihood training as well as for ...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
This master thesis characterizes the performance and energy bottlenecks of speech recognition system...
Sound source localization is an important topic in expert systems involving microphone arrays, such ...
This paper describes the effort with building speaker-clustered acoustic models as a part of the rea...
MSc (Computer Science), North-West University, Mafikeng Campus, 2014In a typical recognition process...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...
In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evalu...
In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihoo...
Automatic speech recognition (ASR) is a very demanding computing task. Much research has been done i...
Abstract — Gaussian mixture models (GMMs) are often used in various data processing and classificati...
This paper introduces the use of Graphics Processors Unit (GPU) for computing acoustic likelihoods i...
Gaussian Mixture Model (GMM) computations in modern Automatic Speech Recognition systems are known t...
Gaussian Mixture Model (GMM) statistics are required for maximum likelihood training as well as for ...
Accurate, real-time Automatic Speech Recognition (ASR) comes at a high energy cost, so accuracy has ...
This master thesis characterizes the performance and energy bottlenecks of speech recognition system...
Sound source localization is an important topic in expert systems involving microphone arrays, such ...
This paper describes the effort with building speaker-clustered acoustic models as a part of the rea...
MSc (Computer Science), North-West University, Mafikeng Campus, 2014In a typical recognition process...
LVCSR systems are usually based on continuous density HMMs, which are typically implemented using Ga...
In an automatic speech recognition system us-ing a tied-mixture acoustic model, the main cost in CPU...
EUROSPEECH2003: 8th European Conference on Speech Communication and Technology, September 1-4, 2003...