In this vision paper, we propose a shift in perspective for improving the effectiveness of similarity search. Rather than focusing solely on enhancing the data quality, particularly machine learning-generated embeddings, we advocate for a more comprehensive approach that also enhances the underpinning search mechanisms. We highlight three novel avenues that call for a redefinition of the similarity search problem: exploiting implicit data structures and distributions, engaging users in an iterative feedback loop, and moving beyond a single query vector. These novel pathways have gained relevance in emerging applications such as large-scale language models, video clip retrieval, and data labeling. We discuss the corresponding research challe...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
International audienceWe consider the image retrieval problem of finding the images in a dataset tha...
Qin D., Chen Y., Guillaumin M., Van Gool L., ''Learning to rank bag-of-word histograms for large-sca...
Machine learning is the embodiment of an unapologetically data-driven philosophy that has increasing...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Similarity searching has become more and more popular, which was stimulated by the growth of diverse...
A similarity cache can reply to a query for an object with similar objects stored locally. In some a...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
This paper introduces BoostMap, a method that can significantly reduce retrieval time in image and v...
University of Minnesota Ph.D. dissertation. January 2013. Major: Computer science. Advisor: Nikolaos...
Finding similar entities over data graphs is an important problem with many applications. Current si...
Similarity search is a key operation in multimedia retrieval systems and recommender systems, and it...
none2This volume contains 2 invited papers, 11 regular papers, 2 posters and 6 demos, presented at S...
SimRank is an influential link-based similarity measure that has been used in many fields of Web sea...
International audienceWe study an indexing architecture to store and search in a database of high-di...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
International audienceWe consider the image retrieval problem of finding the images in a dataset tha...
Qin D., Chen Y., Guillaumin M., Van Gool L., ''Learning to rank bag-of-word histograms for large-sca...
Machine learning is the embodiment of an unapologetically data-driven philosophy that has increasing...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Similarity searching has become more and more popular, which was stimulated by the growth of diverse...
A similarity cache can reply to a query for an object with similar objects stored locally. In some a...
Similarity search based on a distance function in metric spaces is a fundamental problem for many ap...
This paper introduces BoostMap, a method that can significantly reduce retrieval time in image and v...
University of Minnesota Ph.D. dissertation. January 2013. Major: Computer science. Advisor: Nikolaos...
Finding similar entities over data graphs is an important problem with many applications. Current si...
Similarity search is a key operation in multimedia retrieval systems and recommender systems, and it...
none2This volume contains 2 invited papers, 11 regular papers, 2 posters and 6 demos, presented at S...
SimRank is an influential link-based similarity measure that has been used in many fields of Web sea...
International audienceWe study an indexing architecture to store and search in a database of high-di...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
International audienceWe consider the image retrieval problem of finding the images in a dataset tha...
Qin D., Chen Y., Guillaumin M., Van Gool L., ''Learning to rank bag-of-word histograms for large-sca...