We report on our ongoing work in developing the Irish Dependency Treebank, describe the results of two Inter annotator Agreement (IAA) studies, demonstrate improvements in annotation consistency which have a knock-on effect on parsing accuracy, and present the final set of dependency labels. We then go on to investigate the extent to which active learning can play a role in treebank and parser development by comparing an active learning bootstrapping approach to a passive approach in which sentences are chosen at random for manual revision. We show that active learning outperforms passive learning, but when annotation effort is taken into account, it is not clear how much of an advantage the active learning approach has. Finally, we present...
In this paper we present a partial dependency parser for Irish, in which Constraint Grammar (CG) rul...
Universal Dependency (UD) annotations, despite their usefulness for cross-lingual tasks and semantic...
In natural language acquisition, it is difficult to gather the annotated data needed for supervised ...
We report on our ongoing work in developing the Irish Dependency Treebank, describe the results of t...
Language resources are essential for linguistic research and the development of NLP applications. Lo...
Despite enjoying the status of an official EU language, Irish is considered a minority language. As ...
We present a number of semi-supervised parsing experiments on the Irish language carried out using a...
Language resources are essential for linguistic research and the development of NLP applications. Lo...
We present a study of cross-lingual direct transfer parsing for the Irish language. Firstly we disc...
Modern Irish is a minority language lacking sufficient computational resources for the task of accur...
We present a study that compares data-driven dependency parsers obtained by means of annotation proj...
Institute for Communicating and Collaborative SystemsActive learning reduces annotation costs for su...
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors:...
The Universal Dependencies Project1 (Nivre, [9]; Nivre et al., [10]) is an ongoing effort towards ...
Parsing is a step for understanding a natural language to find out about the words and their grammat...
In this paper we present a partial dependency parser for Irish, in which Constraint Grammar (CG) rul...
Universal Dependency (UD) annotations, despite their usefulness for cross-lingual tasks and semantic...
In natural language acquisition, it is difficult to gather the annotated data needed for supervised ...
We report on our ongoing work in developing the Irish Dependency Treebank, describe the results of t...
Language resources are essential for linguistic research and the development of NLP applications. Lo...
Despite enjoying the status of an official EU language, Irish is considered a minority language. As ...
We present a number of semi-supervised parsing experiments on the Irish language carried out using a...
Language resources are essential for linguistic research and the development of NLP applications. Lo...
We present a study of cross-lingual direct transfer parsing for the Irish language. Firstly we disc...
Modern Irish is a minority language lacking sufficient computational resources for the task of accur...
We present a study that compares data-driven dependency parsers obtained by means of annotation proj...
Institute for Communicating and Collaborative SystemsActive learning reduces annotation costs for su...
Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors:...
The Universal Dependencies Project1 (Nivre, [9]; Nivre et al., [10]) is an ongoing effort towards ...
Parsing is a step for understanding a natural language to find out about the words and their grammat...
In this paper we present a partial dependency parser for Irish, in which Constraint Grammar (CG) rul...
Universal Dependency (UD) annotations, despite their usefulness for cross-lingual tasks and semantic...
In natural language acquisition, it is difficult to gather the annotated data needed for supervised ...