International audienceIn this paper, we propose a model for semantic clustering of entities extracted from a text, and we apply it to a Proper Noun classification task. This model is based on a new method to compute the similarity between the entities. In- deed, the classical way of calculating similarity is to build a feature vector or Bag-of-Features for each entity and then use classical similarity functions like cosine. In practice, the fea- tures are contextual ones, such as words around the different occurrences of each entity. Here, we propose to use an alternative representation for en- tities, called Bag-Of-Vectors, or Bag-of-Bags-of-Features. In this new model, each entity is not defined as a unique vector but as a set of vectors,...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
This thesis deals with semantic similarity of words. It describes and compares existing models that ...
This paper describes the use of clustering at three stages within a larger research effort to identi...
International audienceIn this paper, we propose a model for semantic clustering of entities extracte...
International audienceIn this paper, we propose a model for semantic clustering of entities extracte...
National audienceComputing distances between textual representation is at the heart of many Natural ...
National audienceComputing distances between textual representation is at the heart of many Natural ...
National audienceComputing distances between textual representation is at the heart of many Natural ...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
In this paper, an unsupervised semantic class induction algorithm is proposed that is based on the p...
The basic Bag of Words (BOW) representation generally used in text documents clustering or categoriz...
Two document representation methods are mainly used in solving text mining problems. Known for its i...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
International audienceThis paper aims to analyze and adopt the term clustering method for building a...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
This thesis deals with semantic similarity of words. It describes and compares existing models that ...
This paper describes the use of clustering at three stages within a larger research effort to identi...
International audienceIn this paper, we propose a model for semantic clustering of entities extracte...
International audienceIn this paper, we propose a model for semantic clustering of entities extracte...
National audienceComputing distances between textual representation is at the heart of many Natural ...
National audienceComputing distances between textual representation is at the heart of many Natural ...
National audienceComputing distances between textual representation is at the heart of many Natural ...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
Similarity analysis is a substantial issue in both corpus-based researches and language usages. This...
In this paper, an unsupervised semantic class induction algorithm is proposed that is based on the p...
The basic Bag of Words (BOW) representation generally used in text documents clustering or categoriz...
Two document representation methods are mainly used in solving text mining problems. Known for its i...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
International audienceThis paper aims to analyze and adopt the term clustering method for building a...
Terms are linguistic signifiers of domain–specific concepts. Semantic similarity between terms refer...
This thesis deals with semantic similarity of words. It describes and compares existing models that ...
This paper describes the use of clustering at three stages within a larger research effort to identi...