Abstract—There is growing interest in representing image data and feature descriptors using compact binary codes for fast near neighbor search. Although binary codes are motivated by their use as direct indices (addresses) into a hash table, codes longer than 32 bits are not being used as such, as it was thought to be ineffective. We introduce a rigorous way to build multiple hash tables on binary code substrings that enables exact k-nearest neighbor search in Hamming space. The approach is storage efficient and straight-forward to implement. Theoretical analysis shows that the algorithm exhibits sub-linear run-time behavior for uniformly distributed codes. Empirical results show dramatic speedups over a linear scan baseline for datasets of...
Similarity preserving hashing can aid forensic investigations by providing means to recognize known ...
Hashing is used to learn binary-code representation for data with expectation of preserving the neig...
The space-partitioning based hashing techniques are widely used to represent high-dimensional data p...
There is growing interest in representing image data and feature descriptors using compact binary co...
There has been growing interest in mapping image data onto compact binary codes for fast near neighb...
This paper addresses the problem of ultra-large-scale search in Hamming spaces. There has been consi...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
International audienceHandling large amounts of data, such as large image databases, requires the us...
Compact locally aggregated binary features have shown great advantages in image search. As the exhau...
Motivated by scalable partial-duplicate visual search, there has been growing interest in a wealth o...
Binary hashing has been widely used for efficient simi-larity search due to its query and storage ef...
Hash-based methods achieve fast similarity search by representing high-dimensional data with compact...
Binary hashing has been widely used for efficient simi-larity search due to its query and storage ef...
Abstract Algorithms to rapidly search massive image or video collections are crit-ical for many visi...
We consider the problem of indexing a text T (of length n) with a light data structure that supports...
Similarity preserving hashing can aid forensic investigations by providing means to recognize known ...
Hashing is used to learn binary-code representation for data with expectation of preserving the neig...
The space-partitioning based hashing techniques are widely used to represent high-dimensional data p...
There is growing interest in representing image data and feature descriptors using compact binary co...
There has been growing interest in mapping image data onto compact binary codes for fast near neighb...
This paper addresses the problem of ultra-large-scale search in Hamming spaces. There has been consi...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
International audienceHandling large amounts of data, such as large image databases, requires the us...
Compact locally aggregated binary features have shown great advantages in image search. As the exhau...
Motivated by scalable partial-duplicate visual search, there has been growing interest in a wealth o...
Binary hashing has been widely used for efficient simi-larity search due to its query and storage ef...
Hash-based methods achieve fast similarity search by representing high-dimensional data with compact...
Binary hashing has been widely used for efficient simi-larity search due to its query and storage ef...
Abstract Algorithms to rapidly search massive image or video collections are crit-ical for many visi...
We consider the problem of indexing a text T (of length n) with a light data structure that supports...
Similarity preserving hashing can aid forensic investigations by providing means to recognize known ...
Hashing is used to learn binary-code representation for data with expectation of preserving the neig...
The space-partitioning based hashing techniques are widely used to represent high-dimensional data p...