This paper reports experiments on a corpus of news articles from the Financial Times, comparing different text similarity models. First the Ferret system using a method based solely on lexical similarities is used, then methods based on semantic similarities are inves-tigated. Different feature string selection criteria are used, for instance with and without synonyms obtained from WordNet, or with noun phrases extracted for comparison. The results indicate that synonyms rather than lexical strings are important for finding similar texts. Hypernyms and noun phrases also contribute to the identification of text similarity, though they are not better than synonyms. However, precision is a problem for the semantic similarity methods because to...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
This paper presents a method for measuring the semantic similarity of texts using a corpus based mea...
Computing text similarity is a foundational technique for a wide range of tasks in natural language ...
This paper reports experiments on a corpus of news articles from the Financial Times, comparing diff...
Text similarity measurement compares text with available references to indicate the degree of simila...
Measuring semantic similarity between texts is calculating semantic relatedness between texts using ...
Abstract. Semantic relatedness refers to the degree to which two concepts or words are related. Huma...
Computing the semantic similarity between terms (or short text expressions) that have the same meani...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
The massive amount of information from the internet has revolutionized the field of natural language...
Abstract: Similarities for textual data The evaluation of similarities between textual entities (do...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
With large number of documents on the web, there is a increasing need to be able to retrieve the bes...
Word similarity is a semantic measure that evaluates the similarity of words. The goal of the master...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
This paper presents a method for measuring the semantic similarity of texts using a corpus based mea...
Computing text similarity is a foundational technique for a wide range of tasks in natural language ...
This paper reports experiments on a corpus of news articles from the Financial Times, comparing diff...
Text similarity measurement compares text with available references to indicate the degree of simila...
Measuring semantic similarity between texts is calculating semantic relatedness between texts using ...
Abstract. Semantic relatedness refers to the degree to which two concepts or words are related. Huma...
Computing the semantic similarity between terms (or short text expressions) that have the same meani...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
The massive amount of information from the internet has revolutionized the field of natural language...
Abstract: Similarities for textual data The evaluation of similarities between textual entities (do...
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and ...
With large number of documents on the web, there is a increasing need to be able to retrieve the bes...
Word similarity is a semantic measure that evaluates the similarity of words. The goal of the master...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
This paper presents a method for measuring the semantic similarity of texts using a corpus based mea...
Computing text similarity is a foundational technique for a wide range of tasks in natural language ...