Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, speech rate, pitch intonation that influence placing of punctuation marks on speech transcripts have been seldom used. We propose a method that uses recurrent neural networks, taking prosodic and lexical information into account in order to predict punctuation marks for raw ASR output. Our experiments show that an attention mechanism over parallel sequences of prosodic cues aligned with transcribed speech improves accuracy of punctuation generation.We would like to thank Francesco Barbieri for offering his technical insigh...
Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is p...
The paper proposes a module for automatic punctuation prediction and casing reconstruction based on ...
We test a series of techniques to predict punctuation and its effect on machine translation (MT) qua...
This thesis deals with the problem of punctuation reconstruction in the output of automatic speech r...
Comunicació presentada a: Interspeech 2018, celebrat del 2 al 6 de setembre de 2018 a Hyderabad, Índ...
Thesis (Master's)--University of Washington, 2022Clarity and precision of written text benefits from...
While speech recognition Word Error Rate (WER) has reached human parity for English, continuous spee...
Natural language processing techniques are dependent upon punctuation to work well. When their input...
Conventional automatic speech recognition systems do not produce punctuation marks which are importa...
In this dissertation, I study the inclusion of prosody into two applications that involve speech und...
Prosodic breaks prediction from text is a fundamental task to obtain naturalness in text to speech a...
International audienceThis paper presents a study of the links between punctuation and automatically...
Punctuation prediction is an important task in Spoken Lan-guage Translation. The output of speech re...
This paper is about the development of statistical models of prosodic features to generate linguisti...
This thesis focuses on the Swedish language, where punctuation restoration, especially as a postproc...
Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is p...
The paper proposes a module for automatic punctuation prediction and casing reconstruction based on ...
We test a series of techniques to predict punctuation and its effect on machine translation (MT) qua...
This thesis deals with the problem of punctuation reconstruction in the output of automatic speech r...
Comunicació presentada a: Interspeech 2018, celebrat del 2 al 6 de setembre de 2018 a Hyderabad, Índ...
Thesis (Master's)--University of Washington, 2022Clarity and precision of written text benefits from...
While speech recognition Word Error Rate (WER) has reached human parity for English, continuous spee...
Natural language processing techniques are dependent upon punctuation to work well. When their input...
Conventional automatic speech recognition systems do not produce punctuation marks which are importa...
In this dissertation, I study the inclusion of prosody into two applications that involve speech und...
Prosodic breaks prediction from text is a fundamental task to obtain naturalness in text to speech a...
International audienceThis paper presents a study of the links between punctuation and automatically...
Punctuation prediction is an important task in Spoken Lan-guage Translation. The output of speech re...
This paper is about the development of statistical models of prosodic features to generate linguisti...
This thesis focuses on the Swedish language, where punctuation restoration, especially as a postproc...
Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is p...
The paper proposes a module for automatic punctuation prediction and casing reconstruction based on ...
We test a series of techniques to predict punctuation and its effect on machine translation (MT) qua...