Abstract. In the area of Text Retrieval, processing a query in the vector model has been verified to be qualitatively more effective than searching in the boolean model. However, in case of the classic vector model the current methods of processing many-term queries are inefficient, in case of LSI model there does not exist an efficient method for processing even the few-term queries. In this paper we propose a method of vector query processing based on metric indexing, which is efficient especially for the LSI model. In addition, we propose a concept of approximate semi-metric search, which can further improve the efficiency of retrieval process. Results of experiments made on moderate text collection are included.
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
This paper presents the basics of information retrieval: the vector space model for document represe...
Abstract. Text collections represented in LSI model are hard to search efficiently (i.e. quickly), s...
One of the most prominent trends of our time is the emergence of an information society. The amount ...
Metric indexing is a branch of search technology that is designed for search non-textual data. Examp...
In a document retieval, or other pattern matching environment where stored entities (documents) are ...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
With the ever increasing volumes of information generation, users of information systems are facing ...
In order to achieve large scalability, indexing structures are usually distributed to incorporate mo...
The heart of an information retrieval system is its retrieval model. The model is used to capture th...
Web Information Retrieval is another problem of searching elements of a set that are closest to a gi...
In this paper, we focus on indexing and searching in high-dimensional data. To achieve the target we...
The definition of good strategies for Text Retrieval has become in recent years more and more import...
ABSTRACT In the vector space model for information retrieval, term vectors are pair-wise orthogon...
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
This paper presents the basics of information retrieval: the vector space model for document represe...
Abstract. Text collections represented in LSI model are hard to search efficiently (i.e. quickly), s...
One of the most prominent trends of our time is the emergence of an information society. The amount ...
Metric indexing is a branch of search technology that is designed for search non-textual data. Examp...
In a document retieval, or other pattern matching environment where stored entities (documents) are ...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
With the ever increasing volumes of information generation, users of information systems are facing ...
In order to achieve large scalability, indexing structures are usually distributed to incorporate mo...
The heart of an information retrieval system is its retrieval model. The model is used to capture th...
Web Information Retrieval is another problem of searching elements of a set that are closest to a gi...
In this paper, we focus on indexing and searching in high-dimensional data. To achieve the target we...
The definition of good strategies for Text Retrieval has become in recent years more and more import...
ABSTRACT In the vector space model for information retrieval, term vectors are pair-wise orthogon...
We propose a text indexing technique for approximate pattern matching, which is practical and especi...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
This paper presents the basics of information retrieval: the vector space model for document represe...