A classification method is presented that detects the presence of speech embedded in a real acoustic background of non-speech sounds. Features used for classification are modulation com-ponents extracted by computation of the amplitude modulation spectrogram. Feature selection techniques and support vector classification are employed to identify modulation components most salient for the classification task and therefore considered as highly characteristic for speech. Results show that reliable detection of speech can be performed with less than 10 opti-mally selected modulation features, the most important ones are located in the modulation frequency range below 10 Hz. Detec-tion of speech in a background of non-speech signals is per-forme...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
In this paper, a feature set derived from modulation spectra is applied to the task of detecting sin...
Robust detection of speech embedded in real acoustic back-ground noise is considered using an approa...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
The need for efficient, sophisticated features for speech event detection is inherent in state of th...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
This paper shows an effective speech/non-speech discrimination method for improving the performance ...
In this contribution, an acoustic event detection system based on spectro-temporal features and a tw...
Speech recognition by machines is an important technology for the 21st century. Speech signals are p...
The accurate extraction of two particular features from the speech signal affected by additive white...
In this paper, we combine modulation spectral features with mel-frequency cepstral coefficients for ...
Speech boundary detection contributes to performance of speech based applications such as speech rec...
Voice activity detection (VAD) aims at identifying presence of speech in a noisy signal. For this pu...
This thesis describes techniques for voice activity detection in audio recordings. It is necessary t...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
In this paper, a feature set derived from modulation spectra is applied to the task of detecting sin...
Robust detection of speech embedded in real acoustic back-ground noise is considered using an approa...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
The need for efficient, sophisticated features for speech event detection is inherent in state of th...
In this work, we adopt an information theoretic approach- the Information Bottleneck method- to extr...
This paper shows an effective speech/non-speech discrimination method for improving the performance ...
In this contribution, an acoustic event detection system based on spectro-temporal features and a tw...
Speech recognition by machines is an important technology for the 21st century. Speech signals are p...
The accurate extraction of two particular features from the speech signal affected by additive white...
In this paper, we combine modulation spectral features with mel-frequency cepstral coefficients for ...
Speech boundary detection contributes to performance of speech based applications such as speech rec...
Voice activity detection (VAD) aims at identifying presence of speech in a noisy signal. For this pu...
This thesis describes techniques for voice activity detection in audio recordings. It is necessary t...
Performance of automatic speech recognition relies on a vast amount of training speech data mostly r...
We describe a content-based audio classification algorithm based on novel multiscale spectro-tempora...
In this paper, a feature set derived from modulation spectra is applied to the task of detecting sin...