The dependency of word similarity in vector space models on the frequency of words has been noted in a few studies, but has received very little attention. We study the influence of word frequency in a set of 10 000 randomly selected word pairs for a number of different combinations of feature weighting schemes and similarity measures. We find that the similarity of word pairs for all methods, except for the one using singular value decomposition to reduce the dimensionality of the feature space, is determined to a large extent by the frequency of the words. In a binary classification task of pairs of synonyms and unrelated words we find that for all similarity measures the results can be improved when we correct for the frequency bias
In distributional semantics words are represented by aggregated context features. The similarity of ...
In this paper, we consider two applications of distributional similarity measures, probability estim...
Modelling semantic similarity plays a fundamental role in lexical semantic applications. A natural w...
The dependency of word similarity in vector space models on the frequency of words has been noted in...
The dependency of word similarity in vector space models on the frequency of words has been noted in...
Distributional semantics tries to characterize the meaning of words by the contexts in which they oc...
© Published under licence by IOP Publishing Ltd. In this study a similarity in changes of frequencie...
© Published under licence by IOP Publishing Ltd. In this study a similarity in changes of frequencie...
This paper aims to re-think the role of the word similarity task in distributional semantics researc...
This work investigates the variation in a word's distributionally nearest neighbours with respect to...
This work investigates the variation in a word's distributionally nearest neighbours with respe...
International audienceA computational model of the construction of word meaning through exposure to ...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
In distributional semantics words are represented by aggregated context features. The similarity of ...
In this paper, we consider two applications of distributional similarity measures, probability estim...
Modelling semantic similarity plays a fundamental role in lexical semantic applications. A natural w...
The dependency of word similarity in vector space models on the frequency of words has been noted in...
The dependency of word similarity in vector space models on the frequency of words has been noted in...
Distributional semantics tries to characterize the meaning of words by the contexts in which they oc...
© Published under licence by IOP Publishing Ltd. In this study a similarity in changes of frequencie...
© Published under licence by IOP Publishing Ltd. In this study a similarity in changes of frequencie...
This paper aims to re-think the role of the word similarity task in distributional semantics researc...
This work investigates the variation in a word's distributionally nearest neighbours with respect to...
This work investigates the variation in a word's distributionally nearest neighbours with respe...
International audienceA computational model of the construction of word meaning through exposure to ...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
This research exploits the English and Dutch CELEX lexical database to investigate the form similari...
In distributional semantics words are represented by aggregated context features. The similarity of ...
In this paper, we consider two applications of distributional similarity measures, probability estim...
Modelling semantic similarity plays a fundamental role in lexical semantic applications. A natural w...