Abstract. Fast elimination of duplicate data is needed in many areas, especially in the textual data context. A solution to this problem was recently found for geometrical data using a hash function to speed up the process. The usage of the hash function is extremely efficient when incremental elimination is required especially for processing large data sets. In this paper a new construction of the hash function is presented, giving short clusters with few collisions only. The proposed hash function is not a perfect hash function nevertheless it gives similar properties to it. The hash function used takes advantage of the relatively large amount of available memory on modern computers, and works well with large data sets. Experiments have p...
Techniques based on hashing are heavily used in many applications, e.g. information retrieval, geom...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Abstract. Approximate near neighbor search plays a critical role in various kinds of multimedia appl...
Fast elimination of duplicate data is needed in many areas, especially in the textual data context....
AbstractIt is generally assumed that hashing is essential to solve many language processing problems...
It is generally assumed that hashing is essential to solve many language processing problems efficie...
Abstract—Many techniques for text processing are based on efficient data storing and retrieval techn...
Abstract. We present a new analysis of the well-known family of multiplicative hash functions, and i...
AbstractNew methods for computing perfect hash functions and applications of such functions to the p...
This thesis is centered around one of the most basic information retrieval problems, namely that of ...
It is generally assumed that hashing is essential to solve many language process-ing problems effici...
This paper proposes new hash functions for indexing local image descriptors. These functions are fir...
This paper deals with the construction of digital lexicons within the scope of Natural Language Proc...
A heuristic is given for finding minimal perfect hash functions without extensive searching. The pr...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Techniques based on hashing are heavily used in many applications, e.g. information retrieval, geom...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Abstract. Approximate near neighbor search plays a critical role in various kinds of multimedia appl...
Fast elimination of duplicate data is needed in many areas, especially in the textual data context....
AbstractIt is generally assumed that hashing is essential to solve many language processing problems...
It is generally assumed that hashing is essential to solve many language processing problems efficie...
Abstract—Many techniques for text processing are based on efficient data storing and retrieval techn...
Abstract. We present a new analysis of the well-known family of multiplicative hash functions, and i...
AbstractNew methods for computing perfect hash functions and applications of such functions to the p...
This thesis is centered around one of the most basic information retrieval problems, namely that of ...
It is generally assumed that hashing is essential to solve many language process-ing problems effici...
This paper proposes new hash functions for indexing local image descriptors. These functions are fir...
This paper deals with the construction of digital lexicons within the scope of Natural Language Proc...
A heuristic is given for finding minimal perfect hash functions without extensive searching. The pr...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Techniques based on hashing are heavily used in many applications, e.g. information retrieval, geom...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Abstract. Approximate near neighbor search plays a critical role in various kinds of multimedia appl...