We present the general architecture of the error annotation system applied to the COPLE2 corpus, a learner corpus of Portuguese implemented on the TEITOK platform. We give a general overview of the corpus and of the TEITOK functionalities and describe how the error annotation is structured in a two-level system: first, a fully manual token-based and coarse-grained annotation is applied and produces a rough classification of the errors in three categories, paired with multi-level information for POS and lemma; second, a multi-word and fine-grained annotation in standoff is then semi-automatically produced based on the first level of annotation. The token-based level has been applied to 47% of the total corpus. We compare our system with othe...
This paper describes an ongoing project in which we are collecting a learner corpus of Arabic, devel...
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão.P...
While automatically computing numerical scores remains the dominant paradigm in NLP system evaluatio...
We present the general architecture of the error annotation system applied to the COPLE2 corpus, a l...
We present the error tagging system of the COPLE2 corpus and the first results of its implementation...
In this article, we present COPLE2, a new corpus of Portuguese that encompasses written and spoken d...
We present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts ...
International audienceIn this paper, we address the question of automatic annotation of English lear...
In this thesis, we investigate methods for automatic detection, and to some extent correction, of gr...
Error coding of second-language learner text, that is, detecting, correcting and annotating errors, ...
We present a freely available corpus containing source language texts from different domains along w...
The article explores the possibility of adopting a form-to-function perspective when annotating lear...
Annotating a corpus with error information is a challenging task. This paper describes the design, e...
A survey of the literature shows that annotating errors of Arabic learners has not received much att...
The present study analyses the errors identified in the written argumentative texts of 304 Spanish u...
This paper describes an ongoing project in which we are collecting a learner corpus of Arabic, devel...
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão.P...
While automatically computing numerical scores remains the dominant paradigm in NLP system evaluatio...
We present the general architecture of the error annotation system applied to the COPLE2 corpus, a l...
We present the error tagging system of the COPLE2 corpus and the first results of its implementation...
In this article, we present COPLE2, a new corpus of Portuguese that encompasses written and spoken d...
We present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts ...
International audienceIn this paper, we address the question of automatic annotation of English lear...
In this thesis, we investigate methods for automatic detection, and to some extent correction, of gr...
Error coding of second-language learner text, that is, detecting, correcting and annotating errors, ...
We present a freely available corpus containing source language texts from different domains along w...
The article explores the possibility of adopting a form-to-function perspective when annotating lear...
Annotating a corpus with error information is a challenging task. This paper describes the design, e...
A survey of the literature shows that annotating errors of Arabic learners has not received much att...
The present study analyses the errors identified in the written argumentative texts of 304 Spanish u...
This paper describes an ongoing project in which we are collecting a learner corpus of Arabic, devel...
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão.P...
While automatically computing numerical scores remains the dominant paradigm in NLP system evaluatio...