International audienceWe present a large, free, French corpus of online written conversations extracted from the Ubuntu platform's forums, mailing lists and IRC channels. The corpus is meant to support multi-modality and diachronic studies of online written conversations. We choose to build the corpus around a robust metadata model based upon strong principles, such as the "stand off" annotation principle. We detail the model, we explain how the data was collected and processed-in terms of meta-data, text and conversation-and we detail the corpus' contents through a series of meaningful statistics. A portion of the corpus-about 4,700 sentences from emails, forum posts and chat messages sent in November 2014-is annotated in terms of dialogue...
The Greek Fragment of the WikiConv dataset. See: https://arxiv.org/abs/1810.13181 Abstract: We prese...
International audienceTechnical online chats distinguish themselves from more usual natural language...
This paper describes a corpus of situated multiparty chats developed for the STAC project (Strategic...
International audienceWe present a large, free, French corpus of online written conversations extrac...
International audienceIn internet chatrooms, multiple conversations may occur simultaneously. The ta...
We describe the acquisition of a dialog corpus for French based on multi-task human-machine interact...
In this paper, we describe our experience with collecting and creating an annotated corpus of multi-...
International audienceThis paper describes the ANNODIS resource, a discourse-level annotated corpus ...
International audienceThis paper describes the ANNODIS resource, a discourse-level annotated corpus ...
The CoMeRe project aims to build a kernel corpus of different Computer-Mediated Commu-nication (CMC)...
We are interested in problem-solving online written conversations. These conversations may be found ...
International audienceWe present in this article a french chat corpus, intended for the study of cha...
Final version to Special Issue of JLCL (Journal of Language Technology and Computational Linguistics...
Final version to Special Issue of JLCL (Journal of Language Technology and Computational Linguistics...
International audienceWe present in this article a french chat corpus, intended for the study of cha...
The Greek Fragment of the WikiConv dataset. See: https://arxiv.org/abs/1810.13181 Abstract: We prese...
International audienceTechnical online chats distinguish themselves from more usual natural language...
This paper describes a corpus of situated multiparty chats developed for the STAC project (Strategic...
International audienceWe present a large, free, French corpus of online written conversations extrac...
International audienceIn internet chatrooms, multiple conversations may occur simultaneously. The ta...
We describe the acquisition of a dialog corpus for French based on multi-task human-machine interact...
In this paper, we describe our experience with collecting and creating an annotated corpus of multi-...
International audienceThis paper describes the ANNODIS resource, a discourse-level annotated corpus ...
International audienceThis paper describes the ANNODIS resource, a discourse-level annotated corpus ...
The CoMeRe project aims to build a kernel corpus of different Computer-Mediated Commu-nication (CMC)...
We are interested in problem-solving online written conversations. These conversations may be found ...
International audienceWe present in this article a french chat corpus, intended for the study of cha...
Final version to Special Issue of JLCL (Journal of Language Technology and Computational Linguistics...
Final version to Special Issue of JLCL (Journal of Language Technology and Computational Linguistics...
International audienceWe present in this article a french chat corpus, intended for the study of cha...
The Greek Fragment of the WikiConv dataset. See: https://arxiv.org/abs/1810.13181 Abstract: We prese...
International audienceTechnical online chats distinguish themselves from more usual natural language...
This paper describes a corpus of situated multiparty chats developed for the STAC project (Strategic...