We studied the treebanks included in HamleDT and partially unified their label sets. Afterwards, we used a method based on variation n-grams to automatically detect errors in morphological and dependency annotation. Then we used the output of a part-of-speech tagger / dependency parser trained on each treebank to correct the detected errors. The performance of both the detection and the correction of errors on both annotation levels was manually evaluated on a randomly selected samples of suspected errors from several treebanks. Powered by TCPDF (www.tcpdf.org
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
Abstract. We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compila...
We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of ex...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inco...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
We propose HamleDT – HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of exis...
We propose HamleDT – HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of exis...
This chapter presents a methodology for identifying and resolving various kinds of inconsistency in ...
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
Abstract. We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compila...
We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of ex...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inco...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...
We propose HamleDT – HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of exis...
We propose HamleDT – HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of exis...
This chapter presents a methodology for identifying and resolving various kinds of inconsistency in ...
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency tree...
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank a...