Digital libraries are a core information technology. When the stored data is complex, e.g. high-resolution images or molecular protein structures, simple query types like the exact match query are hardly applicable. In such environments similarity queries, particularly range queries and k-nearest neighbor queries, turn out to be important query types. Numerous approaches have been proposed for the processing of similarity queries which mainly concentrate on highly dynamic data sets where insertion, update, and deletion operations permanently occur. However, only little effort has been devoted to the case of rather static data sets - a case that frequently occurs in digital libraries. In this paper, we introduce a novel technique for efficie...
The nearest neighbor algorithm is the most basic class of techniques in the sub-fields of machine le...
Similarity-based search has been a key factor for many applications such as multimedia retrieval, da...
To save memory and improve speed, vectorial data such as images and signals are often represented as...
Digital libraries are a core information technology. When the stored data is complex, e.g. high-reso...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
Abstract—We consider the problem of finding similar patterns in a time sequence. Typical application...
Edit distance is the most widely used method to quantify similarity between two strings. We investig...
Document similarity has important real life applications such as finding duplicate web sites and ide...
Abstract. Similarity queries searching for the most similar objects in a database compared to a give...
Efficient and effective methods of making data accessible to its consumers - be they humans or algor...
As databases increasingly integrate different types of information such as time-series, multimedia a...
Similarity search is a crucial task in multimedia retrieval and data mining. Most existing work has ...
The nearest neighbor algorithm is the most basic class of techniques in the sub-fields of machine le...
Similarity-based search has been a key factor for many applications such as multimedia retrieval, da...
To save memory and improve speed, vectorial data such as images and signals are often represented as...
Digital libraries are a core information technology. When the stored data is complex, e.g. high-reso...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
Abstract—We consider the problem of finding similar patterns in a time sequence. Typical application...
Edit distance is the most widely used method to quantify similarity between two strings. We investig...
Document similarity has important real life applications such as finding duplicate web sites and ide...
Abstract. Similarity queries searching for the most similar objects in a database compared to a give...
Efficient and effective methods of making data accessible to its consumers - be they humans or algor...
As databases increasingly integrate different types of information such as time-series, multimedia a...
Similarity search is a crucial task in multimedia retrieval and data mining. Most existing work has ...
The nearest neighbor algorithm is the most basic class of techniques in the sub-fields of machine le...
Similarity-based search has been a key factor for many applications such as multimedia retrieval, da...
To save memory and improve speed, vectorial data such as images and signals are often represented as...