This work assesses different approaches for speech and non-speech segmentation of audio data and proposes a new, high-level representation of audio signals based on phoneme recognition features suitable for speech/non-speech discrimination tasks. Unlike previous model-based approaches, where speech and non-speech classes were usually modeled by several models, we develop a representation where just one model per class is used in the segmentation process. For this purpose, four measures based on consonant-vowel pairs obtained from different phoneme speech recognizers are introduced and applied in two different segmentation-classification frameworks. The segmentation systems were evaluated on different broadcast news databases. The evaluatio...
Speech recognition system requires segmentation of speech waveform into fundamental acoustic units. ...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
Segment based direct models have recently been used to im-prove the output of existing state-of-the-...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
This article talks about two majors ways of performing a speech/music segmentation task. The first o...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. Th...
This paper presents a new approach to phoneme recognition using nonsequential sub-phoneme units. The...
Robust acoustic segmentation has become a critical issue in order to apply speech recognition to aud...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
In this thesis, we study the segmentation of an audio stream in speech, music and speech on music (S...
Audio segmentation is important as a pre-processing task to improve the performance of many speech t...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
In this thesis, the use of multiple acoustic features of the speech signal is considered for speech ...
Speech recognition system requires segmentation of speech waveform into fundamental acoustic units. ...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
Segment based direct models have recently been used to im-prove the output of existing state-of-the-...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
This article talks about two majors ways of performing a speech/music segmentation task. The first o...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
This paper presents a new approach to phoneme recognition using nonsequential sub--phoneme units. Th...
This paper presents a new approach to phoneme recognition using nonsequential sub-phoneme units. The...
Robust acoustic segmentation has become a critical issue in order to apply speech recognition to aud...
The human auditory system is very well matched to both hu-man speech and environmental sounds. There...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
In this thesis, we study the segmentation of an audio stream in speech, music and speech on music (S...
Audio segmentation is important as a pre-processing task to improve the performance of many speech t...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
In this thesis, the use of multiple acoustic features of the speech signal is considered for speech ...
Speech recognition system requires segmentation of speech waveform into fundamental acoustic units. ...
Statistical data-driven methods and knowledge-based methods are two recent trends in Automatic Speec...
Segment based direct models have recently been used to im-prove the output of existing state-of-the-...