This paper addresses the problem of Approximate Nearest Neighbor (ANN) search in pattern recognition where feature vectors in a database are encoded as compact codes in or- der to speed-up the similarity search in large-scale databases. Considering the ANN problem from an information-theoretic perspective, we interpret it as an encoding, which maps the original feature vectors to a less entropic sparse represen- tation while requiring them to be as informative as possi- ble. We then define the coding gain for ANN search using information-theoretic measures. We next show that the clas- sical approach to this problem, which consists of binarization of the projected vectors is sub-optimal. Instead, a properly designed ternary encoding achieves...
International audienceAssociative memories aim at matching an input noisy vector with a stored one. ...
This paper presents a novel compact coding ap-proach, composite quantization, for approximate neares...
Numerous applications demand that we manipulate large sets of very high-dimensional signals. A simpl...
In this paper, we propose a novel scheme for approximate nearest neighbor (ANN) retrieval based on d...
International audienceThis paper proposes a binarization scheme for vectors of high dimension based ...
We consider the problem of fast content identification in high-dimensional feature spaces where a su...
The technological developments of the last twenty years are leading the world to a new era. The inve...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
International audienceThis paper tackles the task of storing a large collection of vectors, such as ...
Increasing sizes of databases and data stores mean that the traditional tasks, such as locating a ne...
International audienceA new method is introduced that makes use of sparse image representations to s...
The quantization techniques have shown competitive performance in approximate nearest neighbor searc...
International audienceMany nearest neighbor search algorithms rely on encoding real vectors into bin...
Hash-based methods achieve fast similarity search by representing high-dimensional data with compact...
Sparse coding is a basic task in many fields including signal processing, neuroscience and machine l...
International audienceAssociative memories aim at matching an input noisy vector with a stored one. ...
This paper presents a novel compact coding ap-proach, composite quantization, for approximate neares...
Numerous applications demand that we manipulate large sets of very high-dimensional signals. A simpl...
In this paper, we propose a novel scheme for approximate nearest neighbor (ANN) retrieval based on d...
International audienceThis paper proposes a binarization scheme for vectors of high dimension based ...
We consider the problem of fast content identification in high-dimensional feature spaces where a su...
The technological developments of the last twenty years are leading the world to a new era. The inve...
Scalable similarity search on images, documents, and user activities benefits generic search, data v...
International audienceThis paper tackles the task of storing a large collection of vectors, such as ...
Increasing sizes of databases and data stores mean that the traditional tasks, such as locating a ne...
International audienceA new method is introduced that makes use of sparse image representations to s...
The quantization techniques have shown competitive performance in approximate nearest neighbor searc...
International audienceMany nearest neighbor search algorithms rely on encoding real vectors into bin...
Hash-based methods achieve fast similarity search by representing high-dimensional data with compact...
Sparse coding is a basic task in many fields including signal processing, neuroscience and machine l...
International audienceAssociative memories aim at matching an input noisy vector with a stored one. ...
This paper presents a novel compact coding ap-proach, composite quantization, for approximate neares...
Numerous applications demand that we manipulate large sets of very high-dimensional signals. A simpl...