In this paper we present a novel approach to automatic creation of anchor texts for hyper-links in a document pointing to similar doc-uments. Methods used in this approach rank parts of a document based on the similarity to a presumably related document. Ranks are then used to automatically construct the best anchor text for a link inside original document to the compared document. A number of dif-ferent methods from information retrieval and natural language processing are adapted for this task. Automatically constructed anchor texts are manually evaluated in terms of relat-edness to linked documents and compared to baseline consisting of originally inserted an-chor texts. Additionally we use crowdsourc-ing for evaluation of original ancho...
International audienceWikipedia, the largest open-collaborative online encyclopedia, is a corpus of ...
This paper presents and compares two methods for eval-uating the syntactic similarity between docume...
Textbooks are even more available in electronic format nowadays than in the past. As the size of a...
The unprecedented growth of the World Wide Web illustrates the importance of hypertext as a method f...
submitted for publication Abstract. Assessing semantic similarity between text documents is a crucia...
grantor: University of TorontoWe describe a novel method for automatically generating hype...
We present a method for automatic generation of in-text explanatory hyperlinks for use in web pub-li...
This paper investigates the use and the prediction potential of semantic similarity measures for aut...
Assessing semantic similarity between text documents is a crucial aspect in Information Retrieval sy...
Abstract: In this paper, a unified framework for clustering documents based on vocabulary overlap an...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
this paper we describe an approach we have developed to semi-automatically generate a hypertext from...
Abstract—The retrieval of similar documents from the Web using documents as input instead of key-ter...
AbstractWe propose a method for computing semantic relatedness between words or texts by using knowl...
Abstract — As the volume of information is in internet is increasing staggeringly therefore it is re...
International audienceWikipedia, the largest open-collaborative online encyclopedia, is a corpus of ...
This paper presents and compares two methods for eval-uating the syntactic similarity between docume...
Textbooks are even more available in electronic format nowadays than in the past. As the size of a...
The unprecedented growth of the World Wide Web illustrates the importance of hypertext as a method f...
submitted for publication Abstract. Assessing semantic similarity between text documents is a crucia...
grantor: University of TorontoWe describe a novel method for automatically generating hype...
We present a method for automatic generation of in-text explanatory hyperlinks for use in web pub-li...
This paper investigates the use and the prediction potential of semantic similarity measures for aut...
Assessing semantic similarity between text documents is a crucial aspect in Information Retrieval sy...
Abstract: In this paper, a unified framework for clustering documents based on vocabulary overlap an...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
this paper we describe an approach we have developed to semi-automatically generate a hypertext from...
Abstract—The retrieval of similar documents from the Web using documents as input instead of key-ter...
AbstractWe propose a method for computing semantic relatedness between words or texts by using knowl...
Abstract — As the volume of information is in internet is increasing staggeringly therefore it is re...
International audienceWikipedia, the largest open-collaborative online encyclopedia, is a corpus of ...
This paper presents and compares two methods for eval-uating the syntactic similarity between docume...
Textbooks are even more available in electronic format nowadays than in the past. As the size of a...