International audienceThis paper describes the use of the CasSys platform in order to achieve the chunking of conversational speech transcripts by means of cascades of Unitex transducers. Our system is involved in the EPAC project of the French National agency of Research (ANR). The aim of this project is to develop robust methods for the annotation of audio/multimedia document collections which contains conversational speech sequences such as TV or radio programs. At first, this paper presents the EPAC project and the adaptation of a former chunking system (Romus) which was developed in the restricted framework of dedicated spoken man-machine dialogue. Then, it describes the problems that are arising due to 1) spontaneous speech disfluenci...
Interoperable annotation formats are fundamental to the utility, expansion, and sustainability of co...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
International audienceThis paper presents a corpus-based analysis of coreference and anaphoric relat...
International audienceThis paper describes the use of the CasSys platform in order to achieve the ch...
This paper describes the use of the CasSys platform in order to achieve the chunking of conversation...
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech...
International audienceThis paper presents the preliminary works to put online a French oral corpus a...
Le traitement automatique de la parole est un domaine qui englobe un grand nombre de travaux : de la...
This paper discusses a methodology for the processing of large amounts of speech data using database...
Annotating spoken corpora poses unique challenges stemming from the particular characteristics of sp...
This paper presents the preliminary works to put online a French oral corpus and its transcription. ...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper reports on the setup and evaluation of robust speech recognition system parts, geared tow...
Those latest decades, the development of information and communication technologies has substantiall...
International audienceSPPAS is a tool to produce automatic annotations which include utterance, word...
Interoperable annotation formats are fundamental to the utility, expansion, and sustainability of co...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
International audienceThis paper presents a corpus-based analysis of coreference and anaphoric relat...
International audienceThis paper describes the use of the CasSys platform in order to achieve the ch...
This paper describes the use of the CasSys platform in order to achieve the chunking of conversation...
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech...
International audienceThis paper presents the preliminary works to put online a French oral corpus a...
Le traitement automatique de la parole est un domaine qui englobe un grand nombre de travaux : de la...
This paper discusses a methodology for the processing of large amounts of speech data using database...
Annotating spoken corpora poses unique challenges stemming from the particular characteristics of sp...
This paper presents the preliminary works to put online a French oral corpus and its transcription. ...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper reports on the setup and evaluation of robust speech recognition system parts, geared tow...
Those latest decades, the development of information and communication technologies has substantiall...
International audienceSPPAS is a tool to produce automatic annotations which include utterance, word...
Interoperable annotation formats are fundamental to the utility, expansion, and sustainability of co...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
International audienceThis paper presents a corpus-based analysis of coreference and anaphoric relat...