Word sense induction (WSI) seeks to automat-ically discover the senses of a word in a cor-pus via unsupervised methods. We propose a sense-topic model for WSI, which treats sense and topic as two separate latent vari-ables to be inferred jointly. Topics are in-formed by the entire document, while senses are informed by the local context surrounding the ambiguous word. We also discuss unsu-pervised ways of enriching the original cor-pus in order to improve model performance, including using neural word embeddings and external corpora to expand the context of each data instance. We demonstrate significant im-provements over the previous state-of-the-art, achieving the best results reported to date on the SemEval-2013 WSI task.
An important problem in Natural Language Processing is identifying the correct sense of a word in a ...
Contextualized word embeddings have been employed effectively across several tasks in Natural Langua...
In recent years, there has been an increasing interest in learning a distributed representation of w...
In this paper, we present a novel approach to Word Sense Induction which is based on topic modeling....
Word sense induction (WSI), or the task of automatically discovering multiple senses or meanings of ...
Word sense induction (WSI) is the problem of automatically building an inventory of senses for a set...
Unsupervised word sense disambiguation (WSD) methods are an attractive approach to all-words WSD due...
We describe our language-independent un-supervised word sense induction system. This system only use...
Word Sense Disambiguation (WSD) and Word Sense Induction (WSI) are two fundamental tasks in Natural ...
Word sense induction (WSI) is a challenging problem in natural language processing that involves the...
The current trend in NLP is the use of highly opaque models, e.g. neural networks and word embedding...
This paper presents an unsupervised algorithm which automatically discovers word senses from text. T...
International audienceIn this paper, we present a unified model for the automatic induction of word ...
The use of word senses in place of surface word forms has been shown to improve performance on many ...
© 2016 Andrew BennettThere has recently been significant interest in unsupervised methods for learni...
An important problem in Natural Language Processing is identifying the correct sense of a word in a ...
Contextualized word embeddings have been employed effectively across several tasks in Natural Langua...
In recent years, there has been an increasing interest in learning a distributed representation of w...
In this paper, we present a novel approach to Word Sense Induction which is based on topic modeling....
Word sense induction (WSI), or the task of automatically discovering multiple senses or meanings of ...
Word sense induction (WSI) is the problem of automatically building an inventory of senses for a set...
Unsupervised word sense disambiguation (WSD) methods are an attractive approach to all-words WSD due...
We describe our language-independent un-supervised word sense induction system. This system only use...
Word Sense Disambiguation (WSD) and Word Sense Induction (WSI) are two fundamental tasks in Natural ...
Word sense induction (WSI) is a challenging problem in natural language processing that involves the...
The current trend in NLP is the use of highly opaque models, e.g. neural networks and word embedding...
This paper presents an unsupervised algorithm which automatically discovers word senses from text. T...
International audienceIn this paper, we present a unified model for the automatic induction of word ...
The use of word senses in place of surface word forms has been shown to improve performance on many ...
© 2016 Andrew BennettThere has recently been significant interest in unsupervised methods for learni...
An important problem in Natural Language Processing is identifying the correct sense of a word in a ...
Contextualized word embeddings have been employed effectively across several tasks in Natural Langua...
In recent years, there has been an increasing interest in learning a distributed representation of w...