This paper describes the development of a hybrid tool for a semi-automated process for validation of treebank annotation at various levels. The tool is developed for error detection at the part-of-speech, chunk and dependency levels of a Hindi treebank, currently under development. The tool aims to identify as many errors as possible at these levels to achieve consistency in the task of annotation. Consistency in treebank annotation is a must for making data as error-free as possible and for providing quality assurance. The tool is aimed at ensuring consistency and to make manual validation cost effective. We discuss a rule based and a hybrid approach (statistical methods combined with rule-based methods) by which a high-recall system can b...
In this paper, we present several ways to measure and evaluate the annotation and annotators, propos...
This study discusses evaluation methods for linguists to use when employing an automatically annotat...
Kashmiri is a resource poor language with very less computational and language resources available f...
A treebank is an important resource for developing many NLP based tools. Errors in the treebank may ...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency t...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
The paper describes an approach to automati-cally annotate a Hindi Treebank using Pan-inian dependen...
Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inco...
We studied the treebanks included in HamleDT and partially unified their label sets. Afterwards, we ...
In this paper, we propose a method for au-tomatic clause boundary annotation in the Hindi Dependency...
The article presents the system for annotation quality checking, proposed and used during the buildi...
In this paper, we propose a method for au-tomatic clause boundary annotation in the Hindi Dependency...
In this paper, we present several ways to measure and evaluate the annotation and annotators, propos...
This study discusses evaluation methods for linguists to use when employing an automatically annotat...
Kashmiri is a resource poor language with very less computational and language resources available f...
A treebank is an important resource for developing many NLP based tools. Errors in the treebank may ...
This paper describes a statistical approach to detect annotation errors in dependency treebanks. As ...
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency t...
Abstract. Treebanks play an important role in the development of var-ious natural language processin...
This paper discusses an automatic, data-driven approach to treebank error detection. The approach ad...
This work describes how derivation tree fragments based on a variant of Tree Adjoining Grammar (TAG)...
The paper describes an approach to automati-cally annotate a Hindi Treebank using Pan-inian dependen...
Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inco...
We studied the treebanks included in HamleDT and partially unified their label sets. Afterwards, we ...
In this paper, we propose a method for au-tomatic clause boundary annotation in the Hindi Dependency...
The article presents the system for annotation quality checking, proposed and used during the buildi...
In this paper, we propose a method for au-tomatic clause boundary annotation in the Hindi Dependency...
In this paper, we present several ways to measure and evaluate the annotation and annotators, propos...
This study discusses evaluation methods for linguists to use when employing an automatically annotat...
Kashmiri is a resource poor language with very less computational and language resources available f...