The sud4science LR project 1 aimed at studying a fairly recent form of written communication: SMS (Short Message Service). The first step of the project was to collect a large number of text messages from the general public. We initially gathered 93'085 SMS and our final corpus, entitled 88milSMS , contains over 88'000 SMS. 2 In this article, we propose a novel approach (which is also applicable to other textual data) for classifying unknown items in 88milSMS , based on two steps: 1) Classification of SMS in relation to 5 European languages (French, Spanish, English, German, Italian), 2) Classification of unknown items accordi ng to predefined classes (schedules, items containing special character(s), number(s), words without accents, or wi...
L’objectif du projet « Pratiques contemporaines de la textualité numérique : observation, descriptio...
International audienceHandwriting is an alternative method for entering texts which composed Short M...
International audienceLa récolte de SMS dans le cadre du projet sud4science Languedoc-Roussillon. Mu...
The sud4science LR project1 aimed at studying a fairly recent form of written communication: SMS (Sh...
International audienceThe sud4science LR project (http://www.sud4science.org/) aimed at studying a f...
International audienceDepuis 2014, le corpus 88milSMS est disponible en téléchargement public (Panck...
This paper details an international project called sms4science that aims to collect text message cor...
International audienceIn this article, firstly we briefly summarise the sud4science project and data...
In this article, firstly we briefly summarise the sud4science project and data collection (http://su...
International audienceIn this article, firstly we briefly summarise the sud4science project and data...
This article highlights an approach based on authentic data, by focusing on recent research related ...
In this paper, our hypothesis is that the SMS register of the written language shares certain featur...
88milSMS est un corpus de plus de 88 000 SMS authentiques français, recueillis à Montpellier en 2011...
L’objectif du projet « Pratiques contemporaines de la textualité numérique : observation, descriptio...
International audienceHandwriting is an alternative method for entering texts which composed Short M...
International audienceLa récolte de SMS dans le cadre du projet sud4science Languedoc-Roussillon. Mu...
The sud4science LR project1 aimed at studying a fairly recent form of written communication: SMS (Sh...
International audienceThe sud4science LR project (http://www.sud4science.org/) aimed at studying a f...
International audienceDepuis 2014, le corpus 88milSMS est disponible en téléchargement public (Panck...
This paper details an international project called sms4science that aims to collect text message cor...
International audienceIn this article, firstly we briefly summarise the sud4science project and data...
In this article, firstly we briefly summarise the sud4science project and data collection (http://su...
International audienceIn this article, firstly we briefly summarise the sud4science project and data...
This article highlights an approach based on authentic data, by focusing on recent research related ...
In this paper, our hypothesis is that the SMS register of the written language shares certain featur...
88milSMS est un corpus de plus de 88 000 SMS authentiques français, recueillis à Montpellier en 2011...
L’objectif du projet « Pratiques contemporaines de la textualité numérique : observation, descriptio...
International audienceHandwriting is an alternative method for entering texts which composed Short M...
International audienceLa récolte de SMS dans le cadre du projet sud4science Languedoc-Roussillon. Mu...