In this paper we show how a single framework for computational modeling of linguistic similarity can be used for solving many problems. Similarity can be measured within or across languages and at various linguistic levels. We model linguistic similarity in three stages: surface similarity, contextual similarity and distributional similarity. We have successfully used the framework for several applications like spell checking and text normalization, unsupervised shallow morphological analysis, improving information retrieval, transliteration, cognate identification, language identification and sentence alignment etc. For all these applications, we have been able to obtain results comparable with the state of the art.
Despite being one of the most popular tasks in lexical semantics, word similarity has often been lim...
Researchers on bilingual processing can benefit from computational tools developed in artificial int...
Early work in the computational treatment of natural language focused on summariza-tion, and machine...
ABSTRACT Syntactic similarity is an important activity in the area of high field of text documen...
This paper will focus on automatic methods for quantifying language similarity. This is achieved by ...
This research addresses the problem of deriving semantic similarity between words of language using ...
This paper addresses the problems of mea-suring similarity between languages— where the term languag...
In many natural language understanding applications, text processing requires comparing lexical unit...
A neural language model trained on a text corpus can be used to induce distributed representations o...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
Most text processing systems need to compare lexical units – words, entities, semantic concepts – wi...
As an initial effort to identify universal and language-specific factors that influence the behavior...
A neural language model trained on a text corpus can be used to induce distributed representations o...
Despite being one of the most popular tasks in lexical semantics, word similar-ity has often been li...
Despite being one of the most popular tasks in lexical semantics, word similarity has often been lim...
Researchers on bilingual processing can benefit from computational tools developed in artificial int...
Early work in the computational treatment of natural language focused on summariza-tion, and machine...
ABSTRACT Syntactic similarity is an important activity in the area of high field of text documen...
This paper will focus on automatic methods for quantifying language similarity. This is achieved by ...
This research addresses the problem of deriving semantic similarity between words of language using ...
This paper addresses the problems of mea-suring similarity between languages— where the term languag...
In many natural language understanding applications, text processing requires comparing lexical unit...
A neural language model trained on a text corpus can be used to induce distributed representations o...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
Most text processing systems need to compare lexical units – words, entities, semantic concepts – wi...
As an initial effort to identify universal and language-specific factors that influence the behavior...
A neural language model trained on a text corpus can be used to induce distributed representations o...
Despite being one of the most popular tasks in lexical semantics, word similar-ity has often been li...
Despite being one of the most popular tasks in lexical semantics, word similarity has often been lim...
Researchers on bilingual processing can benefit from computational tools developed in artificial int...
Early work in the computational treatment of natural language focused on summariza-tion, and machine...