Structure preserving grammar compaction (SPC) is a simple CFG compaction technique originally described in (van Genabith et al., 1999a, 1999b). It works by generalising category labels and in so doing plugs holes in the grammar. To date the method has been tested on small corpra only. In the present research we apply SPC to a large grammar extracted from the Penn Treebank and examine its effects on rule treebank grammar size and on rule accession rates (as an indicator of grammar completeness) . 1 Introduction Tree banks and resources compiled from treebanks are potentially very useful in NLP. Grammars extracted from treebanks --- so called treebank grammars (Charniak, 1996) --- can form the basis of large coverage NLP systems. Such treeban...
This paper describes Grammar Learning by Partition Search, a general method for automatically constr...
In this dissertation I investigate ways to extend the annotation of treebanks, or parsed corpora, by...
We present a method for automatically annotating treebank resources with functional structures. The ...
Structure preserving grammar compaction (SPC) is a simple CFG compaction technique originally descri...
Treebanks, such as the Penn Treebank, provide a basis for the automatic creation of broad coverage g...
Manual, large scale (computational) grammar development is time consuming, expensive and requires lo...
This paper presents empirical studies and closely corresponding theoretical models of the performanc...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
Developing large-scale deep grammars in a constraint-based framework such as Lexical Functional Gram...
The development of large coverage, rich unification- (constraint-) based grammar resources is very t...
This paper describes how electronic grammars can be further enhanced by adding machine-readable gram...
We present a Bayesian nonparametric model for estimating tree insertion grammars (TIG), building upo...
The trees in the Penn Treebank have a standard representation that involves complete balanced bracke...
Grammars are core elements of many NLP applications. Grammars can be developed in two ways: built by...
This paper describes Grammar Learning by Partition Search, a general method for automatically constr...
In this dissertation I investigate ways to extend the annotation of treebanks, or parsed corpora, by...
We present a method for automatically annotating treebank resources with functional structures. The ...
Structure preserving grammar compaction (SPC) is a simple CFG compaction technique originally descri...
Treebanks, such as the Penn Treebank, provide a basis for the automatic creation of broad coverage g...
Manual, large scale (computational) grammar development is time consuming, expensive and requires lo...
This paper presents empirical studies and closely corresponding theoretical models of the performanc...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
Developing large-scale deep grammars in a constraint-based framework such as Lexical Functional Gram...
The development of large coverage, rich unification- (constraint-) based grammar resources is very t...
This paper describes how electronic grammars can be further enhanced by adding machine-readable gram...
We present a Bayesian nonparametric model for estimating tree insertion grammars (TIG), building upo...
The trees in the Penn Treebank have a standard representation that involves complete balanced bracke...
Grammars are core elements of many NLP applications. Grammars can be developed in two ways: built by...
This paper describes Grammar Learning by Partition Search, a general method for automatically constr...
In this dissertation I investigate ways to extend the annotation of treebanks, or parsed corpora, by...
We present a method for automatically annotating treebank resources with functional structures. The ...