Paper accepted at the Language Resources and Evaluation Conference (LREC) 2020International audienceThis article describes the constitution process of the first morpho-syntactically annotated Tunisian Arabish Corpus (TArC). Arabish, also known as Arabizi, is a spontaneous coding of Arabic dialects in Latin characters and arithmographs (numbers used as letters). This code-system was developed by Arabic-speaking users of social media in order to facilitate the writing in the Computer-Mediated Communication (CMC) and text messaging informal frameworks. There is variety in the realization of Arabish amongst dialects, and each Arabish code-system is under-resourced, in the same way as most of the Arabic dialects. In the last few years, the focus...
Le développement d’outils de traitement automatique pour les dialectes de l’arabe se heurte à l’abse...
A non-standard romanization of Arabic script, known as Arbizi, is widely used in Arabic online and S...
International audienceThis paper presents a critical description of natural language processing for ...
This article describes the constitution process of the first morpho-syntactically annotated Tunisian...
This article describes the collection process of the first morpho-syntactically annotated Tunisian a...
International audienceIn this paper we present the final result of a project on Tunisian Arabic enco...
International audienceCet article décrit la procédure de constitution du premier corpus d'arabish tu...
Cet article décrit la procédure de constitution du premier corpus d’arabish tunisien (TArC) annoté a...
This dataset has been created between 2017 and 2021 to provide a textual resource that can be used t...
International audienceThe constitution of an oral corpus of Tunisian Arabic for the analysis of the ...
This paper presents preliminary results in building an annotated corpus of the Palestinian Arabic di...
This thesis deals with the linguistic resources creation of spoken Tunisian Arabic. First, we descri...
The term corpus comes from Latin and means “body”. According to corpus linguists, a corpus can be de...
The sociolinguistic situation in Arabic countries is characterized by diglossia (Ferguson, 1959): wh...
Due to the rapid developments in technology and the sudden expansion of social media use, Dialect Ar...
Le développement d’outils de traitement automatique pour les dialectes de l’arabe se heurte à l’abse...
A non-standard romanization of Arabic script, known as Arbizi, is widely used in Arabic online and S...
International audienceThis paper presents a critical description of natural language processing for ...
This article describes the constitution process of the first morpho-syntactically annotated Tunisian...
This article describes the collection process of the first morpho-syntactically annotated Tunisian a...
International audienceIn this paper we present the final result of a project on Tunisian Arabic enco...
International audienceCet article décrit la procédure de constitution du premier corpus d'arabish tu...
Cet article décrit la procédure de constitution du premier corpus d’arabish tunisien (TArC) annoté a...
This dataset has been created between 2017 and 2021 to provide a textual resource that can be used t...
International audienceThe constitution of an oral corpus of Tunisian Arabic for the analysis of the ...
This paper presents preliminary results in building an annotated corpus of the Palestinian Arabic di...
This thesis deals with the linguistic resources creation of spoken Tunisian Arabic. First, we descri...
The term corpus comes from Latin and means “body”. According to corpus linguists, a corpus can be de...
The sociolinguistic situation in Arabic countries is characterized by diglossia (Ferguson, 1959): wh...
Due to the rapid developments in technology and the sudden expansion of social media use, Dialect Ar...
Le développement d’outils de traitement automatique pour les dialectes de l’arabe se heurte à l’abse...
A non-standard romanization of Arabic script, known as Arbizi, is widely used in Arabic online and S...
International audienceThis paper presents a critical description of natural language processing for ...