This paper presents the algorithms which power Google Cor-relate[8], a tool which finds web search terms whose popu-larity over time best matches a user-provided time series. Correlate was developed to generalize the query-based mod-eling techniques pioneered by Google Flu Trends and make them available to end users. Correlate searches across millions of candidate query time series to find the best matches, returning results in less than 200 milliseconds. Its feature set and requirements present unique challenges for Approximate Nearest Neighbor (ANN) search techniques. In this paper, we present Asymmetric Hashing (AH), the technique used by Correlate, and show how it can be adapted to fit the specific needs of the product. We then develop ...
In situations where there is interest in finding a collection of past records that are similar (but ...
The semantic meaning of a content is frequently represented by content vectors in which each dimensi...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...
Approximate Nearest neighbor search (ANNS) is fundamental and essential operation in applications f...
The Approximate Nearest Neighbor (ANN) search problem is important in applications such as informati...
Recently, hashing based Approximate Nearest Neighbor (ANN) techniques have been attracting lots of a...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
We investigate the idea of finding semantically related search engine queries based on their tempora...
Hashing has been widely used for large-scale approximate nearest neighbor search because of its stor...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
As databases increasingly integrate different types of information such as time-series, multimedia a...
The long-standing problem of efficient nearest-neighbor (NN) search has ubiqui-tous applications ran...
The rapid expansion of the web is causing the constant growth of information, leading to several pro...
In situations where there is interest in finding a collection of past records that are similar (but ...
The semantic meaning of a content is frequently represented by content vectors in which each dimensi...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...
Approximate Nearest neighbor search (ANNS) is fundamental and essential operation in applications f...
The Approximate Nearest Neighbor (ANN) search problem is important in applications such as informati...
Recently, hashing based Approximate Nearest Neighbor (ANN) techniques have been attracting lots of a...
Many modern applications of AI such as web search, mobile browsing, image processing, and natural la...
We investigate the idea of finding semantically related search engine queries based on their tempora...
Hashing has been widely used for large-scale approximate nearest neighbor search because of its stor...
The nearest- or near-neighbor query problems arise in a large variety of database applications, usua...
Numerous applications in search, databases, machine learning, and computer vision, can benefit from...
Computing the semantic similarity between terms (or short text expressions) that have the same mean...
As databases increasingly integrate different types of information such as time-series, multimedia a...
The long-standing problem of efficient nearest-neighbor (NN) search has ubiqui-tous applications ran...
The rapid expansion of the web is causing the constant growth of information, leading to several pro...
In situations where there is interest in finding a collection of past records that are similar (but ...
The semantic meaning of a content is frequently represented by content vectors in which each dimensi...
Approximate Nearest Neighbor (ANN) search in high dimensional space has become a fundamental paradig...