Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken utterance, any such disfluencies must be identified and removed or otherwise dealt with. Operating on transcripts of speech which contain disfluencies, we study the effect of language model and loss function on the performance of a linear reranker that rescores the 25-best output of a noisychannel model. We show that language models trained on large amounts of non-speech data improve performance more than a languagemodel trained on amoremodest amount of speech data, and that optimising f-score rather than log loss improves disfluency detection performance. Our approach uses a log-linear reranker, operating on the top n analyses of a noisy chann...
In this paper we present a new statistical model, which de-scribes the corruption to speech recognit...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...
Unrehearsed spoken language often contains many disfluencies. If we want to correctly interpret the ...
Theoretical thesis.Bibliography: pages 43-46.1. Introduction -- 2. Literature review -- 3. LSTM nois...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
We propose a novel algorithm to detect disfluency in speech by reformulating the problem as phrase-l...
Spoken language 'grammatical error correction' (GEC) is an important mechanism to help learners of a...
In automatic speech recognition, a stochastic language model (LM) predicts the probability of the ne...
The SOTA in transcription of disfluent and conversational speech has in recent years favored two-sta...
Speech plays a vital role in communication, from expressing oneself, to utilizing speech-based platf...
Speech plays a vital role in communication, from expressing oneself, to utilizing speech-based platf...
In automatic speech recognition, a statistical language model (LM) predicts the probability of the n...
Abstract. Previous research has shown that speech disfluencies- speech errors that occur in spoken l...
Disfluency detection is a critical task in real-time dialogue systems. However, despite its importan...
In this paper we present a new statistical model, which de-scribes the corruption to speech recognit...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...
Unrehearsed spoken language often contains many disfluencies. If we want to correctly interpret the ...
Theoretical thesis.Bibliography: pages 43-46.1. Introduction -- 2. Literature review -- 3. LSTM nois...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
We propose a novel algorithm to detect disfluency in speech by reformulating the problem as phrase-l...
Spoken language 'grammatical error correction' (GEC) is an important mechanism to help learners of a...
In automatic speech recognition, a stochastic language model (LM) predicts the probability of the ne...
The SOTA in transcription of disfluent and conversational speech has in recent years favored two-sta...
Speech plays a vital role in communication, from expressing oneself, to utilizing speech-based platf...
Speech plays a vital role in communication, from expressing oneself, to utilizing speech-based platf...
In automatic speech recognition, a statistical language model (LM) predicts the probability of the n...
Abstract. Previous research has shown that speech disfluencies- speech errors that occur in spoken l...
Disfluency detection is a critical task in real-time dialogue systems. However, despite its importan...
In this paper we present a new statistical model, which de-scribes the corruption to speech recognit...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...
While Transformer-based text classifiers pre-trained on large volumes of text have yielded significa...