The present work illustrates the main results of an experiment on errors and repairs in spoken language transcription, with significant relevance for the evaluation of validity, reliability and correctness of transcriptions of speech belonging to several different typologies, set for the annotation of spoken corpora. In particular, we dealt with errors and repair strategies that appear on the first drafts of the transcription process, that are not easily detectable with automatic post-editing procedures. 20 participants were asked to give an accurate transcription of 22 short utterances, repeated from one to four times, belonging two non-spontaneous (10) and spontaneous conversation (10). Error analysis suggest a general preference for mean...
This paper describes the transcription process and the development of transcription skills in a rese...
The papers brought together in this volume illustrate how spoken corpora (be they native or learner ...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
Transcription of spoken language is an ordinary practice in modern linguistics (particularly in corp...
Transcription of speech is often driven by different transc ribers’ understanding strategies, leadin...
We describe the collection of transcription corrections and grammatical error annotations for the Cr...
Making accurate verbatim transcriptions is very time consuming and in the case of extemporaneous spe...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Abstract. The aim of our paper is to study the interest of part of speech (POS) tagging to improve s...
Spoken discourse is a uniquely valuable source of data in cognitive research. A natural way of repre...
We describe an efficient procedure for automatic repair of quickly transcribed (QT) speech. QT speec...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
Speech recognition technology suffers from a lack of robustness which limits its usability for fully...
For research and development purposes in the areas of phonetics and speech technology, phonetically ...
Transcriptions of speech which aim to show the speaker’s intonation are not sufficiently reliable to...
This paper describes the transcription process and the development of transcription skills in a rese...
The papers brought together in this volume illustrate how spoken corpora (be they native or learner ...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
Transcription of spoken language is an ordinary practice in modern linguistics (particularly in corp...
Transcription of speech is often driven by different transc ribers’ understanding strategies, leadin...
We describe the collection of transcription corrections and grammatical error annotations for the Cr...
Making accurate verbatim transcriptions is very time consuming and in the case of extemporaneous spe...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Abstract. The aim of our paper is to study the interest of part of speech (POS) tagging to improve s...
Spoken discourse is a uniquely valuable source of data in cognitive research. A natural way of repre...
We describe an efficient procedure for automatic repair of quickly transcribed (QT) speech. QT speec...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
Speech recognition technology suffers from a lack of robustness which limits its usability for fully...
For research and development purposes in the areas of phonetics and speech technology, phonetically ...
Transcriptions of speech which aim to show the speaker’s intonation are not sufficiently reliable to...
This paper describes the transcription process and the development of transcription skills in a rese...
The papers brought together in this volume illustrate how spoken corpora (be they native or learner ...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...