Morphosyntactic tagging and syntactic parsing are key parts of Natural Language processing. Many systems now reach exploitable results for written French texts (Véronis, 2000; Clément, 2001), but there were rare attempts to automatically annotate spoken textual data (see though Mertens, 2002; Valli et Véronis, 1999). Indeed, existing software are inadequate to analyse texts transcribed from speech and face specific problems, all related to the nature of the data: • for theoretical reasons (Blanche-Benveniste and Jeanjean, 1987), transcriptions of speech do not contain punctuation marks; nevertheless, most of the tools in Natural Language Processing are based on these marks in order to perform an initial segmentation of the text; • texts inc...
International audienceThe use of computer tools has led to major advances in the study of spoken lan...
International audienceIn the area of large French speech corpora, there is a demonstrated need for a...
To transcribe speech, automatic speech recognition systems use statistical methods, particularly hid...
International audienceResearchers in the field of spoken text processing face specific problems, all...
Abstract. The aim of our paper is to study the interest of part of speech (POS) tagging to improve s...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
International audienceWhereas it was common some years ago to formulate phonetic models on the basis...
International audienceTexts generated by automatic speech recognition (ASR) systems have some specif...
Texts generated by automatic speech recognition (ASR) systems have some specificities, related to th...
This paper describes the process and the resources used to automatically annotate a French corpus of...
A finalised digital resource of 88,000 anonymised French text messages, the 88milSMS corpus, two ext...
This paper describes the process and the resources used to automatically annotate a French corpus of...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceThis paper addresses the problem of the enrichment of transcriptions in the pe...
International audienceThe use of computer tools has led to major advances in the study of spoken lan...
International audienceIn the area of large French speech corpora, there is a demonstrated need for a...
To transcribe speech, automatic speech recognition systems use statistical methods, particularly hid...
International audienceResearchers in the field of spoken text processing face specific problems, all...
Abstract. The aim of our paper is to study the interest of part of speech (POS) tagging to improve s...
International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging ...
International audienceWhereas it was common some years ago to formulate phonetic models on the basis...
International audienceTexts generated by automatic speech recognition (ASR) systems have some specif...
Texts generated by automatic speech recognition (ASR) systems have some specificities, related to th...
This paper describes the process and the resources used to automatically annotate a French corpus of...
A finalised digital resource of 88,000 anonymised French text messages, the 88milSMS corpus, two ext...
This paper describes the process and the resources used to automatically annotate a French corpus of...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceThis paper addresses the problem of the enrichment of transcriptions in the pe...
International audienceThe use of computer tools has led to major advances in the study of spoken lan...
International audienceIn the area of large French speech corpora, there is a demonstrated need for a...
To transcribe speech, automatic speech recognition systems use statistical methods, particularly hid...