The paper proposes a module for automatic punctuation prediction and casing reconstruction based on transformers architectures (BERT/T5) that constitutes the current state-of-the-art in many similar NLP tasks. The main motivation for our work was to increase the readability of the ASR output. The ASR output is usually in the form of a continuous stream of text, without punctuation marks and with all words in lowercase. The resulting punctuation and casing reconstruction module is evaluated on both the written text and the actual ASR output in three languages (English, Czech and Slovak)
Transformers have taken the centre stage for most NLP applications after LSTM’s were established as ...
Sentence unit detection in automated speech recognition (ASR) system is crucial for enriching the AS...
Natural language processing techniques are dependent upon punctuation to work well. When their input...
The paper proposes a module for automatic punctuation prediction and casing reconstruction based on ...
This thesis deals with the problem of punctuation reconstruction in the output of automatic speech r...
This paper presents a contribution to PolEval 2021 Task 1: Punctuation restoration from read text. T...
While speech recognition Word Error Rate (WER) has reached human parity for English, continuous spee...
This thesis focuses on the Swedish language, where punctuation restoration, especially as a postproc...
Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) outp...
This paper presents the work of restoring punctuation for ASR transcripts generated by multilingual ...
Punctuation prediction is an important task in Spoken Lan-guage Translation. The output of speech re...
This paper proposes a flexible approach for punctuation prediction that can be used to produce state...
We test a series of techniques to predict punctuation and its effect on machine translation (MT) qua...
Thesis (Ph. D.)--University of Washington, 2008.Increasing amounts of easily available electronic da...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
Transformers have taken the centre stage for most NLP applications after LSTM’s were established as ...
Sentence unit detection in automated speech recognition (ASR) system is crucial for enriching the AS...
Natural language processing techniques are dependent upon punctuation to work well. When their input...
The paper proposes a module for automatic punctuation prediction and casing reconstruction based on ...
This thesis deals with the problem of punctuation reconstruction in the output of automatic speech r...
This paper presents a contribution to PolEval 2021 Task 1: Punctuation restoration from read text. T...
While speech recognition Word Error Rate (WER) has reached human parity for English, continuous spee...
This thesis focuses on the Swedish language, where punctuation restoration, especially as a postproc...
Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) outp...
This paper presents the work of restoring punctuation for ASR transcripts generated by multilingual ...
Punctuation prediction is an important task in Spoken Lan-guage Translation. The output of speech re...
This paper proposes a flexible approach for punctuation prediction that can be used to produce state...
We test a series of techniques to predict punctuation and its effect on machine translation (MT) qua...
Thesis (Ph. D.)--University of Washington, 2008.Increasing amounts of easily available electronic da...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
Transformers have taken the centre stage for most NLP applications after LSTM’s were established as ...
Sentence unit detection in automated speech recognition (ASR) system is crucial for enriching the AS...
Natural language processing techniques are dependent upon punctuation to work well. When their input...