In recent years, error mining approaches were developed to help identify the most likely sources of parsing failures in parsing sys-tems using handcrafted grammars and lexi-cons. However the techniques they use to enu-merate and count n-grams builds on the se-quential nature of a text corpus and do not eas-ily extend to structured data. In this paper, we propose an algorithm for mining trees and ap-ply it to detect the most likely sources of gen-eration failure. We show that this tree mining algorithm permits identifying not only errors in the generation system (grammar, lexicon) but also mismatches between the structures contained in the input and the input structures expected by our generator as well as a few id-iosyncrasies/error in the ...
One of the advantages of deep grammars, such as those based on HPSG, is that they can be used for ge...
This paper explores the issue of automatically generated ungrammatical data and its use in error det...
Inducing a grammar directly from text is one of the oldest and most challenging tasks in Computation...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
International audienceWe introduce an error mining technique for automatically detecting errors in r...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
We present a method for detecting annotation errors in manually and automatically annotated dependen...
State-of-art systems for grammar error correction often correct errors based on word sequences or ph...
Shortage of available training data is holding back progress in the area of automated error detectio...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
Abstract — Understanding the causes for failure is one of the bottlenecks in the educational process...
This paper describes how a treebank of ungrammatical sentences can be created from a treebank of we...
In this thesis, we investigate methods for automatic detection, and to some extent correction, of gr...
We investigate grammatical error detec-tion in spoken language, and present a data-driven method to ...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
One of the advantages of deep grammars, such as those based on HPSG, is that they can be used for ge...
This paper explores the issue of automatically generated ungrammatical data and its use in error det...
Inducing a grammar directly from text is one of the oldest and most challenging tasks in Computation...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
International audienceWe introduce an error mining technique for automatically detecting errors in r...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
We present a method for detecting annotation errors in manually and automatically annotated dependen...
State-of-art systems for grammar error correction often correct errors based on word sequences or ph...
Shortage of available training data is holding back progress in the area of automated error detectio...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
Abstract — Understanding the causes for failure is one of the bottlenecks in the educational process...
This paper describes how a treebank of ungrammatical sentences can be created from a treebank of we...
In this thesis, we investigate methods for automatic detection, and to some extent correction, of gr...
We investigate grammatical error detec-tion in spoken language, and present a data-driven method to ...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
One of the advantages of deep grammars, such as those based on HPSG, is that they can be used for ge...
This paper explores the issue of automatically generated ungrammatical data and its use in error det...
Inducing a grammar directly from text is one of the oldest and most challenging tasks in Computation...