This paper describes the process and the resources used to automatically annotate a French corpus of spontaneous speech transcriptions in super-chunks. Super-chunks are enhanced chunks that can contain lexical multiword units. This partial parsing is based on a pre-processing stage of the spoken data that consists in reformatting and tagging utterances that break the syntactic structure of the text, such as disfluencies. Spoken specificities were formalized thanks to a systematic linguistic study of a 40-hour-long speech transcription corpus. The chunker uses large-coverage and fine-grained language resources for general written language that have been augmented with resources specific to spoken French. It consists in iteratively applying f...
International audienceThe aim of this paper is to describe an automated process to segment spoken Fr...
The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, expli...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
In this paper we propose a multi-step system for the semiautomatic detection and annotation of disfl...
Annotating spoken corpora poses unique challenges stemming from the particular characteristics of sp...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceThe aim of this paper is to describe an automated process to segment spoken Fr...
The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, expli...
International audienceThe aim of this paper is to describe an automated process to segment spoken Fr...
The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, expli...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes the process and the resources used to automatically annotate a French corpus of...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...
In this paper we propose a multi-step system for the semiautomatic detection and annotation of disfl...
Annotating spoken corpora poses unique challenges stemming from the particular characteristics of sp...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceWe present in this paper a new system, MarsaTag, aiming at segmenting, tagging...
International audienceThe aim of this paper is to describe an automated process to segment spoken Fr...
The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, expli...
International audienceThe aim of this paper is to describe an automated process to segment spoken Fr...
The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, expli...
This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is ba...