Abstract—Nearest Neighbor Search for similar document retrieval suffers from an efficiency problem when scaled to a large dataset. In this paper, we introduce an unsupervised approach based on Locality Sensitive Hashing to alleviate its search complexity problem. The advantage of our proposed approach is that it does not need to scan all the documents for retrieving top-K Nearest Neighbors, instead, a number of hash table lookup operations are conducted to retrieve the top-K candidates. Experiments on two massive news and tweets datasets demonstrate that our approach is able to achieve over an order of speedup compared with the traditional Information Retrieval method and maintain reasonable precision
The problem of landmark recognition has achieved excellent results in small-scale datasets. Instead,...
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficien...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...
This work fulfills sublinear time Near-est Neighbor Search (NNS) in massive-scale document collectio...
Abstract—Similarity search is critical for many database ap-plications, including the increasingly p...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
k-nearest neighbor (k-NN) search aims at nding k points nearest to a query point in a given datase...
International audienceIn this paper, we have presented a new and faster word retrieval approach, whi...
As databases increasingly integrate different types of information such as time-series, multimedia a...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
Nearest neighbor (NN) search in high dimensional space is an important problem in many applications....
Finding nearest neighbors has become an important operation on databases, with applications to text ...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Nearest neighbor (NN) search in high dimensional space is an im-portant problem in many applications...
The problem of landmark recognition has achieved excellent results in small-scale datasets. Instead,...
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficien...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...
This work fulfills sublinear time Near-est Neighbor Search (NNS) in massive-scale document collectio...
Abstract—Similarity search is critical for many database ap-plications, including the increasingly p...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
k-nearest neighbor (k-NN) search aims at nding k points nearest to a query point in a given datase...
International audienceIn this paper, we have presented a new and faster word retrieval approach, whi...
As databases increasingly integrate different types of information such as time-series, multimedia a...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
Nearest neighbor (NN) search in high dimensional space is an important problem in many applications....
Finding nearest neighbors has become an important operation on databases, with applications to text ...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Nearest neighbor (NN) search in high dimensional space is an im-portant problem in many applications...
The problem of landmark recognition has achieved excellent results in small-scale datasets. Instead,...
A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficien...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...