International audienceStatistical and neural network based methods that compute their results by comparing a given text to be analyzed with a reference corpus assume that the reference corpus is complete and reliable enough. In this article, I conduct several experiments to verify this assumption and I suggest ways to improve these reference corpora by using carefully handcrafted linguistic resources
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
The increasing importance of language documentation as a paradigm in linguistic research means that ...
International audienceStatistical and neural network based methods that compute their results by com...
Abstract Over the past several decades, research and development of human language technology has be...
Linguistic resources, such as corpora, thesauruses, and (machine readable) dic-tionaries, are import...
The purpose of this dissertation is to propose a reliability metric and respective validation tools ...
The purpose of this dissertation is to propose a reliability metric and respective validation tools ...
Linguistically annotated corpora that are stored in standardized digital form can be a valuable sour...
We describe a recently developed corpus annotation scheme for evaluating parsers that avoids shortco...
Linguistics has drawn on the large quantities of authentic data contained in language corpora for se...
The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have...
The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have...
Annotated corpora are a fundamental resource for research and development in the field of natural la...
Suitable for graduate students interested in doing theoretical or applied (computational, ELT, etc.)...
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
The increasing importance of language documentation as a paradigm in linguistic research means that ...
International audienceStatistical and neural network based methods that compute their results by com...
Abstract Over the past several decades, research and development of human language technology has be...
Linguistic resources, such as corpora, thesauruses, and (machine readable) dic-tionaries, are import...
The purpose of this dissertation is to propose a reliability metric and respective validation tools ...
The purpose of this dissertation is to propose a reliability metric and respective validation tools ...
Linguistically annotated corpora that are stored in standardized digital form can be a valuable sour...
We describe a recently developed corpus annotation scheme for evaluating parsers that avoids shortco...
Linguistics has drawn on the large quantities of authentic data contained in language corpora for se...
The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have...
The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have...
Annotated corpora are a fundamental resource for research and development in the field of natural la...
Suitable for graduate students interested in doing theoretical or applied (computational, ELT, etc.)...
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
This paper aims to carve out a place for corpus research within theoretical linguistics and psycholi...
The increasing importance of language documentation as a paradigm in linguistic research means that ...