The concept of similarity is used as the basis for many data exploration and data mining tasks. Nearest Neighbor (NN) queries identify the most similar items, or in terms of distance the closest points to a query point. Similarity is traditionally characterized using a distance function between multi-dimensional feature vectors. However, when the data is high-dimensional, traditional distance functions fail to significantly distinguish between the closest and furthest points, as few dissimilar dimensions dominate the distance function. Localized similarity functions, i.e. functions that only consider dimensions close to the query, quantize each dimension independently and only compute similarity for the dimensions where the query and the po...
Nearest neighbor search in high dimensional spaces is an interesting and important problem which is ...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
As databases increasingly integrate different types of information such as time-series, multimedia a...
In this thesis, we study high dimensional approximate similarity search algorithms. High dimensional...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Similarity search is important in information retrieval applications where objects are usually repre...
Similarity search in high-dimensional data spaces is a popular paradigm for many modern database app...
The recent years have witnessed the emerging of vector quantization (VQ) techniques for efficient si...
We have generalised a class of similarity measures that are designed to address the problems associa...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Abstract. Adaptable similarity queries based on quadratic form distance functions are widely popular...
Similarity search problems in high-dimensional data arise in many areas of computer science such as ...
The technological developments of the last twenty years are leading the world to a new era. The inve...
Similarity search in large multimedia databases requires ef- ficient query processing based on suita...
Nearest neighbor search in high dimensional spaces is an interesting and important problem which is ...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
As databases increasingly integrate different types of information such as time-series, multimedia a...
In this thesis, we study high dimensional approximate similarity search algorithms. High dimensional...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Similarity search is important in information retrieval applications where objects are usually repre...
Similarity search in high-dimensional data spaces is a popular paradigm for many modern database app...
The recent years have witnessed the emerging of vector quantization (VQ) techniques for efficient si...
We have generalised a class of similarity measures that are designed to address the problems associa...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Abstract. Adaptable similarity queries based on quadratic form distance functions are widely popular...
Similarity search problems in high-dimensional data arise in many areas of computer science such as ...
The technological developments of the last twenty years are leading the world to a new era. The inve...
Similarity search in large multimedia databases requires ef- ficient query processing based on suita...
Nearest neighbor search in high dimensional spaces is an interesting and important problem which is ...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...