Non-negative spectral factorisation has been used successfully for separation of speech and noise in automatic speech recognition, both in feature-enhancing front-ends and in direct classification. In this work, we propose employing spectro-temporal 2D filters to model dynamic properties of Mel-scale spectrogram patterns in ad-dition to static magnitude features. The results are evaluated using an exemplar-based sparse classifier on the CHiME noisy speech database. After optimisation of static features and modelling of tem-poral dynamics with derivative features, we achieve 87.4 % average score over SNRs from 9 to-6 dB, reducing the word error rate by 28.1 % from our previous static-only features. Index Terms — Automatic speech recognition,...
In this work, a first approach to a robust phoneme recognition task by means of a biologically inspi...
Recent results from physiological and psychoacoustic studies indicate that spectrally and temporally...
© 2014 IEEE. Performing automatic speech recognition using exemplars (templates) holds the promise t...
Speech recognition systems intended for everyday use must be able to cope with a large variety of no...
Although noise robust automatic speech recognition (ASR) has been a topic of intensive research, to ...
Communication by speech is intrinsic for humans. Since the breakthrough of mobile devices and wirele...
Studies from multiple disciplines show that spectro-temporal units of natural languages and human sp...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
This work investigates the application of spectral and temporal speech processing algorithms develop...
Previously we presented an auditory-inspired feed-forward architecture which achieves good performan...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
Kovács G., ''Noise robust automatic speech recognition based on spectro-temporal techniques'', Proef...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
In this work, a first approach to a robust phoneme recognition task by means of a biologically inspi...
Recent results from physiological and psychoacoustic studies indicate that spectrally and temporally...
© 2014 IEEE. Performing automatic speech recognition using exemplars (templates) holds the promise t...
Speech recognition systems intended for everyday use must be able to cope with a large variety of no...
Although noise robust automatic speech recognition (ASR) has been a topic of intensive research, to ...
Communication by speech is intrinsic for humans. Since the breakthrough of mobile devices and wirele...
Studies from multiple disciplines show that spectro-temporal units of natural languages and human sp...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
This work investigates the application of spectral and temporal speech processing algorithms develop...
Previously we presented an auditory-inspired feed-forward architecture which achieves good performan...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
Kovács G., ''Noise robust automatic speech recognition based on spectro-temporal techniques'', Proef...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
© 2014 IEEE. We propose a novel exemplar-based feature enhancement method for automatic speech recog...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
In this work, a first approach to a robust phoneme recognition task by means of a biologically inspi...
Recent results from physiological and psychoacoustic studies indicate that spectrally and temporally...
© 2014 IEEE. Performing automatic speech recognition using exemplars (templates) holds the promise t...