Unrehearsed spoken language often contains many disfluencies. If we want to correctly interpret the content of spoken language, we need to be able to detect these disfluencies and deal with them appropriately. In the work de-scribed here, we use a statistical noisy channel model to detect disfluencies in transcripts of spoken language. Like all statistical approaches, this is natu-rally very data-hungry; however, cor-pora containing transcripts of unre-hearsed spoken language with disflu-encies annotated are a scarce resource, which makes training difficult. We address this issue in the follow-ing ways: First, since written textual corpora are much more abundant than speech corpora, we see whether using a large text corpus to increase the d...
International audienceResearchers in the field of spoken text processing face specific problems, all...
International audienceState-of-the art Spoken Language Understanding models of Spoken Dialog Systems...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
Previous approaches to detecting and correcting speech repairs have for the most part separated thes...
We propose a novel algorithm to detect disfluency in speech by reformulating the problem as phrase-l...
Spoken language 'grammatical error correction' (GEC) is an important mechanism to help learners of a...
Theoretical thesis.Bibliography: pages 43-46.1. Introduction -- 2. Literature review -- 3. LSTM nois...
This paper analyses speech repair clues in spontaneous speech in the MICASE corpus. An algorithm for...
In this paper we present a system which automatically cor-rects disfluencies such as repairs and res...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
In this paper we propose a multi-step system for the semiautomatic detection and annotation of disfl...
In automatic speech recognition, a statistical language model (LM) predicts the probability of the n...
International audienceResearchers in the field of spoken text processing face specific problems, all...
International audienceResearchers in the field of spoken text processing face specific problems, all...
International audienceState-of-the art Spoken Language Understanding models of Spoken Dialog Systems...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
Unrehearsed spoken language often contains disfluencies. In order to correctly interpret a spoken ut...
Previous approaches to detecting and correcting speech repairs have for the most part separated thes...
We propose a novel algorithm to detect disfluency in speech by reformulating the problem as phrase-l...
Spoken language 'grammatical error correction' (GEC) is an important mechanism to help learners of a...
Theoretical thesis.Bibliography: pages 43-46.1. Introduction -- 2. Literature review -- 3. LSTM nois...
This paper analyses speech repair clues in spontaneous speech in the MICASE corpus. An algorithm for...
In this paper we present a system which automatically cor-rects disfluencies such as repairs and res...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
In this paper we propose a multi-step system for the semiautomatic detection and annotation of disfl...
In automatic speech recognition, a statistical language model (LM) predicts the probability of the n...
International audienceResearchers in the field of spoken text processing face specific problems, all...
International audienceResearchers in the field of spoken text processing face specific problems, all...
International audienceState-of-the art Spoken Language Understanding models of Spoken Dialog Systems...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...