This chapter presents a methodology for identifying and resolving various kinds of inconsistency in the context of merging dependency and multiword expression (MWE) annotations, to generate a dependency treebank with comprehensive MWE annotations. Candidates for correction are identified using a variety of heuristics, including an entirely novel one which identifies violations of MWE constituency in the dependency tree, and resolved by arbitration with minimal human intervention. Using this technique, we identified and corrected several hundred inconsistencies across both parse and MWE annotations, representing changes to a significant percentage (well over 10\%) of the MWE instances in the joint corpus and a large difference in MWE tagging...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This thesis explores annotation of multiword expressions in the Prague Dependency Treebank 2.0. We e...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
International audienceThis paper discusses how to analyze syntactically irregular expressions in a s...
We studied the treebanks included in HamleDT and partially unified their label sets. Afterwards, we ...
A fundamental issue in annotation efforts is to ensure that the same phenomena within and across cor...
A fundamental issue in annotation efforts is to ensure that the same phenomena within and across cor...
We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of ex...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
Abstract. We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compila...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
Multiword expressions (MWEs) are quite frequent in languages such as English, but their diversity, t...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This thesis explores annotation of multiword expressions in the Prague Dependency Treebank 2.0. We e...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
International audienceThis paper discusses how to analyze syntactically irregular expressions in a s...
We studied the treebanks included in HamleDT and partially unified their label sets. Afterwards, we ...
A fundamental issue in annotation efforts is to ensure that the same phenomena within and across cor...
A fundamental issue in annotation efforts is to ensure that the same phenomena within and across cor...
We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of ex...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
Abstract. We present HamleDT – a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compila...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
International audienceIn this paper, we investigate various strategies to predict both syntactic dep...
Multiword expressions (MWEs) are quite frequent in languages such as English, but their diversity, t...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...
This chapter aims at presenting different strategies that have been designed to incorporate multiwor...