This paper discusses the problem of automated lexicography. In the corpus-based approach, a lexicographer has to manually group contexts of a target word into clusters in order to identify word senses. When a large number of the contexts is given, this process becomes a tedious and time-consuming task. To overcome this problem, we propose an efficient technique based on unsupervised clustering. We present the spherical Gaussian EM algorithm that can be enhanced by combining a robust initialization method based on Principal Component Analysis. The resulting clusters can provide a structure for analyzing the underlying senses of the target word found in a text corpus. Experimental results on two different data sets of polysemous words indicat...
This paper introduces a new technique of document clustering based on frequent senses. The proposed ...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...
This paper presents an unsupervised algorithm which automatically discovers word senses from text. T...
We describe the results of performing text mining on a challenging problem in natural language proce...
The unavailability of very large corpora with semantically disambiguated words is a major limitation...
Word Sense Induction is a task of automatically finding word senses from large scale texts. It is ge...
In this paper we show that an unsupervised method for ranking word senses automatically can be used ...
A system tbr lexical acquisition is presented where word meanings are represented by clusters of phr...
We will demonstrate the output of a distribu-tional clustering algorithm called Clustering by Commit...
Word sense induction (WSI) is a challenging problem in natural language processing that involves the...
The unavailability of very large corpora with se-mantically disambiguated words is a major limi-tati...
Word sense discrimination is the process of distinguishing the number of unique senses of a target w...
An important problem in Natural Language Processing is identifying the correct sense of a word in a ...
Abstract. A specific sense of a word can be determined by collocation of the words gathered from the...
This paper introduces a new technique of document clustering based on frequent senses. The proposed ...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...
This paper presents an unsupervised algorithm which automatically discovers word senses from text. T...
We describe the results of performing text mining on a challenging problem in natural language proce...
The unavailability of very large corpora with semantically disambiguated words is a major limitation...
Word Sense Induction is a task of automatically finding word senses from large scale texts. It is ge...
In this paper we show that an unsupervised method for ranking word senses automatically can be used ...
A system tbr lexical acquisition is presented where word meanings are represented by clusters of phr...
We will demonstrate the output of a distribu-tional clustering algorithm called Clustering by Commit...
Word sense induction (WSI) is a challenging problem in natural language processing that involves the...
The unavailability of very large corpora with se-mantically disambiguated words is a major limi-tati...
Word sense discrimination is the process of distinguishing the number of unique senses of a target w...
An important problem in Natural Language Processing is identifying the correct sense of a word in a ...
Abstract. A specific sense of a word can be determined by collocation of the words gathered from the...
This paper introduces a new technique of document clustering based on frequent senses. The proposed ...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...
This thesis describes a clustering approach to automatically inferring soft semantic classes and cha...