International audienceMulti-microphone signal processing techniques have the potential to greatly improve the robustness of speech recognition (ASR) in distant microphone settings. However, in everyday environments, typified by complex non-stationary noise backgrounds, designing effective multi-microphone speech recognition systems is non trivial. In particular, optimal performance requires the tight integration of the front-end signal processing and the back-end statistical speech and noise source modelling. The best way to achieve this in a modern deep learning speech recognition framework remains unclear. Further, variability in microphone array design --- and consequent lack of real training data for any particular configuration --- may...
Multichannel fusion strategies are presented for the distributed microphone recognition environment,...
Speech enhancement and automatic speech recognition (ASR) are most often evaluated in matched (or mu...
Robust speech recognition in acoustic environments that contain multiple speech sources and/or compl...
International audienceMulti-microphone signal processing techniques have the potential to greatly im...
When speech is captured with a distant microphone, it includes distortions caused by noise, reverber...
Automatic speech recognition in a room with distant microphones is strongly affected by noise and re...
The availability of realistic simulated corpora is of key importance for the future progress of dist...
Interest within the automatic speech recognition (ASR) research community has recently focused on th...
International audienceMy talk will focus on robustness to background noise in distant-microphone spe...
Speech recognition performance degrades significantly in distant-talking environments, where the sp...
Microphone arrays can be advantageously employed in Automatic Speech Recognition (ASR) systems to al...
Despite sophisticated present day automatic speech recognition (ASR) techniques, a single recognizer...
Distant-microphone automatic speech recognition (ASR) re-mains a challenging goal in everyday enviro...
In this paper we present a new method of signal processing for robust speech recognition using multi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Multichannel fusion strategies are presented for the distributed microphone recognition environment,...
Speech enhancement and automatic speech recognition (ASR) are most often evaluated in matched (or mu...
Robust speech recognition in acoustic environments that contain multiple speech sources and/or compl...
International audienceMulti-microphone signal processing techniques have the potential to greatly im...
When speech is captured with a distant microphone, it includes distortions caused by noise, reverber...
Automatic speech recognition in a room with distant microphones is strongly affected by noise and re...
The availability of realistic simulated corpora is of key importance for the future progress of dist...
Interest within the automatic speech recognition (ASR) research community has recently focused on th...
International audienceMy talk will focus on robustness to background noise in distant-microphone spe...
Speech recognition performance degrades significantly in distant-talking environments, where the sp...
Microphone arrays can be advantageously employed in Automatic Speech Recognition (ASR) systems to al...
Despite sophisticated present day automatic speech recognition (ASR) techniques, a single recognizer...
Distant-microphone automatic speech recognition (ASR) re-mains a challenging goal in everyday enviro...
In this paper we present a new method of signal processing for robust speech recognition using multi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Multichannel fusion strategies are presented for the distributed microphone recognition environment,...
Speech enhancement and automatic speech recognition (ASR) are most often evaluated in matched (or mu...
Robust speech recognition in acoustic environments that contain multiple speech sources and/or compl...