Abstract—Keyword-based search in text-rich multi-dimensional datasets facilitates many novel applications and tools. In this paper, we consider objects that are tagged with keywords and are embedded in a vector space. For these datasets, we study queries that ask for the tightest groups of points satisfying a given set of keywords. We propose a novel method called ProMiSH (Projection and Multi Scale Hashing) that uses random projection and hash-based index structures, and achieves high scalability and speedup. We present an exact and an approximate version of the algorithm. Our empirical studies, both on real and synthetic datasets, show that ProMiSH has a speedup of more than four orders over state-of-the-art tree-based techniques. Our sca...
Similarity search in multimedia databases requires an effi-cient support of nearest-neighbor search ...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...
Keyword based search in content rich multi-dimensional datasets encourages numerous novel applicatio...
Adding text to databases opens up many different innovations and functionalities that can be made fe...
Unlike tree-like indexes adopted in existing works, our index is less conscious to have an upswing o...
Unlike tree-like indexes adopted in existing works, our index is less responsive to the rise of dime...
Keyword based pursuit in content rich multi-dimensional datasets encourages numerous novel applicati...
Unlike the tree indicators used in existing companies, our index is less receptive when it comes to ...
Unlike tree indicators used in current works, our index responds to less effectiveness in order to i...
Keyword search is the most popular technique for querying large tree-structured datasets, often of u...
To create a better and robust system of searching the images from a huge dataset, we started working...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
Similarity search in multimedia databases requires an efficient support of nearest-neighbor search o...
Unlike tree-like ratios adopted in actual whole shebang, our indicator is less active to the rise of...
Similarity search in multimedia databases requires an effi-cient support of nearest-neighbor search ...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...
Keyword based search in content rich multi-dimensional datasets encourages numerous novel applicatio...
Adding text to databases opens up many different innovations and functionalities that can be made fe...
Unlike tree-like indexes adopted in existing works, our index is less conscious to have an upswing o...
Unlike tree-like indexes adopted in existing works, our index is less responsive to the rise of dime...
Keyword based pursuit in content rich multi-dimensional datasets encourages numerous novel applicati...
Unlike the tree indicators used in existing companies, our index is less receptive when it comes to ...
Unlike tree indicators used in current works, our index responds to less effectiveness in order to i...
Keyword search is the most popular technique for querying large tree-structured datasets, often of u...
To create a better and robust system of searching the images from a huge dataset, we started working...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
Similarity search in multimedia databases requires an efficient support of nearest-neighbor search o...
Unlike tree-like ratios adopted in actual whole shebang, our indicator is less active to the rise of...
Similarity search in multimedia databases requires an effi-cient support of nearest-neighbor search ...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, e...