This paper deals with automatic sentence boundary detection in spoken Czech using both textual and prosodic information. This task is important to make automatic speech recognition (ASR) output more readable and easier for downstream language processing modules. We compare and combine three statistical models – hidden Markov model, maximum entropy, and adaptive boosting. We evaluate these methods on two Czech corpora, broadcast news and broadcast conversations, using both manual and ASR transcripts. Our results show that superior results are achieved when all the three models are combined via posterior probability interpolation, and that there is substantial difference among the three methods when using different knowledge sources, as well ...
The bachelor thesis focuses on basic pre-processing (tokenization and segmentation) of Czech texts, ...
Prosodic boundaries in speech are of great relevance to both speech synthesis and audio annotation. ...
This paper reports our initial experiments with automatic punctuation annotation from speech. We hav...
This paper deals with automatic sentence boundary detection in spoken Czech using both textual and p...
Automatic sentence segmentation of speech is important for enriching speech recognition output and a...
Abstract: This article presents a cross-lingual study for agglutinative, fixed stressed languages, l...
Although speech recognition technology has significantly improved during the past few decades, curre...
We compare and contrast two different models for detecting sentence-like units in continuous speech,...
We investigate genre effects on the task of automatic sentence segmentation, focusing on two import...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
This paper deals with the automatic segmentation for Czech Concatenative speech synthesis. Statistic...
In this work we aim at enriching the transcript of an automatic speech recognition system with punct...
Abstract. This paper deals with the automatic segmentation for Czech Concatenative speech synthesis....
This paper deals with the problems of automatic segmentation for the purposes of Czech concatenative...
This paper presents a two-step method of automatic prosodic boundary detection using both textual an...
The bachelor thesis focuses on basic pre-processing (tokenization and segmentation) of Czech texts, ...
Prosodic boundaries in speech are of great relevance to both speech synthesis and audio annotation. ...
This paper reports our initial experiments with automatic punctuation annotation from speech. We hav...
This paper deals with automatic sentence boundary detection in spoken Czech using both textual and p...
Automatic sentence segmentation of speech is important for enriching speech recognition output and a...
Abstract: This article presents a cross-lingual study for agglutinative, fixed stressed languages, l...
Although speech recognition technology has significantly improved during the past few decades, curre...
We compare and contrast two different models for detecting sentence-like units in continuous speech,...
We investigate genre effects on the task of automatic sentence segmentation, focusing on two import...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
This paper deals with the automatic segmentation for Czech Concatenative speech synthesis. Statistic...
In this work we aim at enriching the transcript of an automatic speech recognition system with punct...
Abstract. This paper deals with the automatic segmentation for Czech Concatenative speech synthesis....
This paper deals with the problems of automatic segmentation for the purposes of Czech concatenative...
This paper presents a two-step method of automatic prosodic boundary detection using both textual an...
The bachelor thesis focuses on basic pre-processing (tokenization and segmentation) of Czech texts, ...
Prosodic boundaries in speech are of great relevance to both speech synthesis and audio annotation. ...
This paper reports our initial experiments with automatic punctuation annotation from speech. We hav...