We introduce a new problem, identifying the type of relation that holds between a pair of similar items in a digital library. Being able to provide a reason why items are similar has applications in recommendation, personalization, and search. We investigate the problem within the context of Europeana, a large digital library containing items related to cultural heritage. A range of types of similarity in this collection were identified. A set of 1,500 pairs of items from the collection were annotated using crowdsourcing. A high intertagger agreement (average 71.5 Pearson correlation) was obtained and demonstrates that the task is well defined. We also present several approaches to automatically identifying the type of similarity. The best ...
There are at least two kinds of similarity. Relational similarity is correspondence between relation...
∗Signatures are on file in the Graduate School. With the rise of the computer age, various kinds of ...
The paper argues that automatic link generation and typing methods are needed to find and maintain cr...
The aim of this thesis is to examine one part of Amazon.coms recommender systems: Similar Items. ...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
164 p.The overarching goal of this thesis is to advance on computational models of meaning and their...
Document classification and provenance has become an important area of computer science as the amoun...
Semantic similarity measurement aims to determine the likeness between two text expressions that use...
Textbooks are even more available in electronic format nowadays than in the past. As the size of a...
In recent years, development of tools and methods for measuring document similarity has become a thr...
Search systems have for some time provided users with the ability to request documents similar to a ...
As digital libraries grow, they are prompting new consideration into same-work relationships. They p...
This is the data obtained from crowdsourcing tasks which ask workers to provide similarity metrics b...
We present a semantic similarity-based recommender service. Our experimental application and validat...
There are at least two kinds of similarity. Relational similarity is correspondence between relation...
∗Signatures are on file in the Graduate School. With the rise of the computer age, various kinds of ...
The paper argues that automatic link generation and typing methods are needed to find and maintain cr...
The aim of this thesis is to examine one part of Amazon.coms recommender systems: Similar Items. ...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
164 p.The overarching goal of this thesis is to advance on computational models of meaning and their...
Document classification and provenance has become an important area of computer science as the amoun...
Semantic similarity measurement aims to determine the likeness between two text expressions that use...
Textbooks are even more available in electronic format nowadays than in the past. As the size of a...
In recent years, development of tools and methods for measuring document similarity has become a thr...
Search systems have for some time provided users with the ability to request documents similar to a ...
As digital libraries grow, they are prompting new consideration into same-work relationships. They p...
This is the data obtained from crowdsourcing tasks which ask workers to provide similarity metrics b...
We present a semantic similarity-based recommender service. Our experimental application and validat...
There are at least two kinds of similarity. Relational similarity is correspondence between relation...
∗Signatures are on file in the Graduate School. With the rise of the computer age, various kinds of ...
The paper argues that automatic link generation and typing methods are needed to find and maintain cr...