In this thesis, we investigate methods for automatic detection, and to some extent correction, of grammatical errors. The evaluation is based on manual error annotation in the Cambridge Learner Corpus (CLC), and automatic or semi-automatic annotation of error corpora is one possible application, but the methods are also applicable in other settings, for instance to give learners feedback on their writing or in a proofreading tool used to prepare texts for publication. Apart from the CLC, we use the British National Corpus (BNC) to get a better model of correct usage, WordNet for semantic relations, other machine-readable dictionaries for orthography/morphology, and the Robust Accurate Statistical Parsing (RASP) system to parse both the CLC ...
The Constituent Likelihood Automatic Word-tagging System (CLAWS) was originally designed for the low...
We investigate grammatical error detec-tion in spoken language, and present a data-driven method to ...
Annotated data is an essential ingredient in natural language processing for training and evaluating...
Grammatical Error Correction (GEC) is the task of automatically detecting and correcting grammatical...
We describe the collection of transcription corrections and grammatical error annotations for the Cr...
International audienceIn this paper, we address the question of automatic annotation of English lear...
Traditionally, English grammatical error checking is done by English language professionals. However...
Learning a foreign language requires much practice outside of the classroom. Computer-assisted langu...
Shortage of available training data is holding back progress in the area of automated error detectio...
This work focuses on designing a grammar detection system that understands both structural and conte...
This paper explores the issue of automatically generated ungrammatical data and its use in error det...
The paper describes a corpus of texts produced by non-native speakersof Czech. We discuss its annota...
The demand for computer-assisted language learning systems that can provide corrective feedback on l...
ABSTRACT Many evaluation issues for grammatical error detection have previously been overlooked, mak...
English as a Second Language (ESL) learners ’ writings contain various grammatical errors. Pre-vious...
The Constituent Likelihood Automatic Word-tagging System (CLAWS) was originally designed for the low...
We investigate grammatical error detec-tion in spoken language, and present a data-driven method to ...
Annotated data is an essential ingredient in natural language processing for training and evaluating...
Grammatical Error Correction (GEC) is the task of automatically detecting and correcting grammatical...
We describe the collection of transcription corrections and grammatical error annotations for the Cr...
International audienceIn this paper, we address the question of automatic annotation of English lear...
Traditionally, English grammatical error checking is done by English language professionals. However...
Learning a foreign language requires much practice outside of the classroom. Computer-assisted langu...
Shortage of available training data is holding back progress in the area of automated error detectio...
This work focuses on designing a grammar detection system that understands both structural and conte...
This paper explores the issue of automatically generated ungrammatical data and its use in error det...
The paper describes a corpus of texts produced by non-native speakersof Czech. We discuss its annota...
The demand for computer-assisted language learning systems that can provide corrective feedback on l...
ABSTRACT Many evaluation issues for grammatical error detection have previously been overlooked, mak...
English as a Second Language (ESL) learners ’ writings contain various grammatical errors. Pre-vious...
The Constituent Likelihood Automatic Word-tagging System (CLAWS) was originally designed for the low...
We investigate grammatical error detec-tion in spoken language, and present a data-driven method to ...
Annotated data is an essential ingredient in natural language processing for training and evaluating...