Auditory spectro-temporal representations of reverberant speech are investigated for blind estimation of reverber-ation time (RT) and for single-ended measurement of speech quality. The auditory representations are obtained from an eight-filter filterbank which is used to extract the modulation spectra from temporal envelopes of the speech signal. Gaussian mixture models (GMM), one for each modulation channel and trained on clean speech signals, serve as reference models of normative speech behavior. Consistency measures, computed between re-verberant test signals and each GMM, are mapped to an estimated RT and to an estimated quality score. Experi-ments show that the proposed measures achieve superior performance relative to current “state...
This paper proposes two methods for robust automatic speech recognition (ASR) in reverberant environ...
This paper introduces a novel set of non-linear spectro-temporal features that improve automatic spe...
Abstract—Perceptual models exploiting auditory masking are frequently used in audio and speech proce...
A novelmethod for blind estimation of the reverberation time (RT60) is proposed based on applying sp...
Abstract—A modulation spectral representation is investigated for non-intrusive quality and intellig...
A modulation spectral signal representation is investigated for non-intrusive quality measurement of...
A modulation spectral signal representation is investigated for non-intrusive quality measurement of...
Abstract—In this paper, short- and long-term temporal dynamic information is investigated for the bl...
Reverberation time (RT) is an important parameter for room acoustics characterization, intelligibili...
This study focuses on an unexplored aspect of the performance of algorithms for blind reverberation ...
This study focuses on an unexplored aspect of the performance of algorithms for blind reverberation ...
The reverberation time, T60, is one of the key parameters used to quantify room acoustics. It can pr...
The reverberation time, T60, is one of the key parameters used to quantify room acoustics. It can pr...
An improved algorithm for the estimation of the reverberation time (RT) from reverberant speech sign...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
This paper proposes two methods for robust automatic speech recognition (ASR) in reverberant environ...
This paper introduces a novel set of non-linear spectro-temporal features that improve automatic spe...
Abstract—Perceptual models exploiting auditory masking are frequently used in audio and speech proce...
A novelmethod for blind estimation of the reverberation time (RT60) is proposed based on applying sp...
Abstract—A modulation spectral representation is investigated for non-intrusive quality and intellig...
A modulation spectral signal representation is investigated for non-intrusive quality measurement of...
A modulation spectral signal representation is investigated for non-intrusive quality measurement of...
Abstract—In this paper, short- and long-term temporal dynamic information is investigated for the bl...
Reverberation time (RT) is an important parameter for room acoustics characterization, intelligibili...
This study focuses on an unexplored aspect of the performance of algorithms for blind reverberation ...
This study focuses on an unexplored aspect of the performance of algorithms for blind reverberation ...
The reverberation time, T60, is one of the key parameters used to quantify room acoustics. It can pr...
The reverberation time, T60, is one of the key parameters used to quantify room acoustics. It can pr...
An improved algorithm for the estimation of the reverberation time (RT) from reverberant speech sign...
The speech signal is inherently characterized by its variations in time, which get reflected as vari...
This paper proposes two methods for robust automatic speech recognition (ASR) in reverberant environ...
This paper introduces a novel set of non-linear spectro-temporal features that improve automatic spe...
Abstract—Perceptual models exploiting auditory masking are frequently used in audio and speech proce...