Distant conversational speech recognition is challenging ow-ing to the presence of multiple, overlapping talkers, additional non-speech acoustic sources, and the effects of reverberation. In this paper we review work on distant speech recognition, with an emphasis on approaches which combine multichan-nel signal processing with acoustic modelling, and investi-gate the use of hybrid neural network / hidden Markov model acoustic models for distant speech recognition of meetings recorded using microphone arrays. In particular we investi-gate the use of convolutional and fully-connected neural net-works with different activation functions (sigmoid, rectified linear, and maxout). We performed experiments on the AMI and ICSI meeting corpora, with...
This paper introduces a novel insight to the problem of Automatic Speech Recognition (ASR). Worldwid...
A challenging scenario is addressed in which a distant-talking speech recognizer operates in a noisy...
Fink GA, Hohenner S. Experiments in Distant Talking Speech Recognition Using a Standard Database. In...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art techno...
In many applications, speech recognition must operate in conditions where there are some distances b...
In this paper, we describe our efforts to develop acoustic models and decoding setups suitable for a...
Acoustic modeling based on deep architectures has recently gained remarkable success, with substanti...
The problem of room localization is to determine where, in a multi-room environment, a person is pro...
Recently, the convolutional neural network (CNN) with multiple microphones was proposed to use the d...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
This paper presents an investigation of far field speech recog-nition using beamforming and channel ...
AbstractThis paper introduces a novel insight to the problem of Automatic Speech Recognition (ASR). ...
In an effort to advance the state of the art in continuous peech recognition employing hidden Markov...
We propose robust distant speech recognition by combining multiple microphone-array processing with...
This paper introduces a novel insight to the problem of Automatic Speech Recognition (ASR). Worldwid...
A challenging scenario is addressed in which a distant-talking speech recognizer operates in a noisy...
Fink GA, Hohenner S. Experiments in Distant Talking Speech Recognition Using a Standard Database. In...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art techno...
In many applications, speech recognition must operate in conditions where there are some distances b...
In this paper, we describe our efforts to develop acoustic models and decoding setups suitable for a...
Acoustic modeling based on deep architectures has recently gained remarkable success, with substanti...
The problem of room localization is to determine where, in a multi-room environment, a person is pro...
Recently, the convolutional neural network (CNN) with multiple microphones was proposed to use the d...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
This paper presents an investigation of far field speech recog-nition using beamforming and channel ...
AbstractThis paper introduces a novel insight to the problem of Automatic Speech Recognition (ASR). ...
In an effort to advance the state of the art in continuous peech recognition employing hidden Markov...
We propose robust distant speech recognition by combining multiple microphone-array processing with...
This paper introduces a novel insight to the problem of Automatic Speech Recognition (ASR). Worldwid...
A challenging scenario is addressed in which a distant-talking speech recognizer operates in a noisy...
Fink GA, Hohenner S. Experiments in Distant Talking Speech Recognition Using a Standard Database. In...