Deep processing of natural language requires large scale lexical resources that have sufficient coverage at a sufficient level of detail and accuracy (i.e. both recall and precision). Hand-crafted lexicons are extremely labour-intensive to create and maintain, and require continuous updating and extension to retain their level of usability. In this paper we present a technique for extending lexicons using similarity measures that can be extracted from corpora. The technique involves creating lexical entries for unknown words based on entries for words that are known and that are deemed to be distributionally similar. We demonstrate the usefulness of the approach by providing an extended lexicon for the LinGO system using similarity measures...
We investigate the creation of corpora from web-harvested data following a scalable approach that ha...
Bilingual dictionaries define word equivalents from one language to another, thus acting as an impor...
Word similarity is a semantic measure that evaluates the similarity of words. The goal of the master...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
This research addresses the problem of deriving semantic similarity between words of language using ...
Knowledge-poor corpus-based approaches to natural language processing are attractive in that they do...
A new metaphor of two-dimensional text for data-driven semantic modeling of natural language is prop...
Most text processing systems need to compare lexical units – words, entities, semantic concepts – wi...
This paper will focus on automatic methods for quantifying language similarity. This is achieved by ...
A new metaphor of two-dimensional text for data-driven semantic modeling of natural language is prop...
Semantic similarity between words is becoming a generic problem for many applications of computation...
In this article we present an approach to the automatic discovery of term similarities, which may se...
In many natural language understanding applications, text processing requires comparing lexical unit...
Describing, comparing and evaluating corpora are key issues in corpus-based translation and corpus l...
Abstract. Semantic relatedness refers to the degree to which two concepts or words are related. Huma...
We investigate the creation of corpora from web-harvested data following a scalable approach that ha...
Bilingual dictionaries define word equivalents from one language to another, thus acting as an impor...
Word similarity is a semantic measure that evaluates the similarity of words. The goal of the master...
Deep processing of natural language requires large scale lexical resources that have sufficient cove...
This research addresses the problem of deriving semantic similarity between words of language using ...
Knowledge-poor corpus-based approaches to natural language processing are attractive in that they do...
A new metaphor of two-dimensional text for data-driven semantic modeling of natural language is prop...
Most text processing systems need to compare lexical units – words, entities, semantic concepts – wi...
This paper will focus on automatic methods for quantifying language similarity. This is achieved by ...
A new metaphor of two-dimensional text for data-driven semantic modeling of natural language is prop...
Semantic similarity between words is becoming a generic problem for many applications of computation...
In this article we present an approach to the automatic discovery of term similarities, which may se...
In many natural language understanding applications, text processing requires comparing lexical unit...
Describing, comparing and evaluating corpora are key issues in corpus-based translation and corpus l...
Abstract. Semantic relatedness refers to the degree to which two concepts or words are related. Huma...
We investigate the creation of corpora from web-harvested data following a scalable approach that ha...
Bilingual dictionaries define word equivalents from one language to another, thus acting as an impor...
Word similarity is a semantic measure that evaluates the similarity of words. The goal of the master...