Similarity search is the basis for many data analytics techniques, including k-nearest neighbor classification and outlier detection. Similarity search over large data sets relies on i) a distance metric learned from input examples and ii) an index to speed up search based on the learned distance metric. In interactive systems, input to guide the learning of the distance metric may be provided over time. As this new input changes the learned distance metric, a naive approach would adopt the costly process of re-indexing all items after each metric change. In this paper, we propose the first solution, called OASIS, to instantaneously adapt the index to conform to a changing distance metric without this prohibitive re-indexing process. To ach...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
All pairs similarity search is a problem where a set of data objects is given and the task is to fin...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
Similarity search is a task fundamental to many machine learning and data analytics applications, wh...
Indexing methods have been widely used for fast data retrieval on large scale datasets. When the dat...
Similarity search is a very important operation in multimedia databases and other database applicati...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
Research Doctorate - Doctor of Philosophy (PhD)This thesis presents techniques for accelerating simi...
A novel access structure for similarity search in metric data, called Similarity Hashing (SH), is pr...
Recommender systems are an integral part of many web applica- tions. With increasingly larger user ...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
Similarity-based search has been a key factor for many applications such as multimedia retrieval, da...
Hashing is very useful for fast approximate similarity search on large database. In the unsupervised...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficien...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
All pairs similarity search is a problem where a set of data objects is given and the task is to fin...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
Similarity search is a task fundamental to many machine learning and data analytics applications, wh...
Indexing methods have been widely used for fast data retrieval on large scale datasets. When the dat...
Similarity search is a very important operation in multimedia databases and other database applicati...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
Research Doctorate - Doctor of Philosophy (PhD)This thesis presents techniques for accelerating simi...
A novel access structure for similarity search in metric data, called Similarity Hashing (SH), is pr...
Recommender systems are an integral part of many web applica- tions. With increasingly larger user ...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
Similarity-based search has been a key factor for many applications such as multimedia retrieval, da...
Hashing is very useful for fast approximate similarity search on large database. In the unsupervised...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficien...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
All pairs similarity search is a problem where a set of data objects is given and the task is to fin...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...