Abstract. Text collections represented in LSI model are hard to search efficiently (i.e. quickly), since there exists no indexing method for the LSI matrices. The inverted file, often used in both boolean and classic vector model, cannot be effectively utilized, because query vectors in LSI model are dense. A possible way for efficient search in LSI matrices could be the usage of metric access methods (MAMs). Instead of cosine measure, the MAMs can utilize the deviation metric for query processing as an equivalent dissimilarity measure. However, the intrinsic dimensionality of collections represented by LSI matrices is often large, which decreases MAMs ’ performance in searching. In this paper we introduce σ-LSI, a modification of LSI in wh...
In metric search, worst-case analysis is of little value, as the search invariably degenerates to a ...
Searching in a dataset for elements that are similar to a given query element is a core problem in a...
Abstract. Metric access methods (MAMs) serve as a tool for speeding similarity queries. However, all...
Abstract. In the area of Text Retrieval, processing a query in the vector model has been verified to...
We describe an approach to information retrieval using Latent Semantic Indexing (LSI) that directly ...
The ever increasing amount of data and the growing diversity in data types requires effective and ef...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Latent semantic indexing (LSI) can be used in patent searching to overcome drawbacks of Boolean sear...
Latent Semantic Indexing (LSI) is one of the well-liked techniques in the information retrieval fiel...
International audienceSimilarity search in metric spaces refers to searching elements in data reposi...
In this paper, we focus on indexing and searching in high-dimensional data. To achieve the target we...
Metric indexing is a branch of search technology that is designed for search non-textual data. Examp...
Several methods exists for performing similarity searches quickly using metric indexing. However, mo...
Abstract—LSI usually is conducted by using the singular value decomposition (SVD). The main difficul...
LSI and related methods Given a set of k basis vectors (x1... xk) = Xk, express the data matrix A i...
In metric search, worst-case analysis is of little value, as the search invariably degenerates to a ...
Searching in a dataset for elements that are similar to a given query element is a core problem in a...
Abstract. Metric access methods (MAMs) serve as a tool for speeding similarity queries. However, all...
Abstract. In the area of Text Retrieval, processing a query in the vector model has been verified to...
We describe an approach to information retrieval using Latent Semantic Indexing (LSI) that directly ...
The ever increasing amount of data and the growing diversity in data types requires effective and ef...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Latent semantic indexing (LSI) can be used in patent searching to overcome drawbacks of Boolean sear...
Latent Semantic Indexing (LSI) is one of the well-liked techniques in the information retrieval fiel...
International audienceSimilarity search in metric spaces refers to searching elements in data reposi...
In this paper, we focus on indexing and searching in high-dimensional data. To achieve the target we...
Metric indexing is a branch of search technology that is designed for search non-textual data. Examp...
Several methods exists for performing similarity searches quickly using metric indexing. However, mo...
Abstract—LSI usually is conducted by using the singular value decomposition (SVD). The main difficul...
LSI and related methods Given a set of k basis vectors (x1... xk) = Xk, express the data matrix A i...
In metric search, worst-case analysis is of little value, as the search invariably degenerates to a ...
Searching in a dataset for elements that are similar to a given query element is a core problem in a...
Abstract. Metric access methods (MAMs) serve as a tool for speeding similarity queries. However, all...