The effects of dimensionality reduction on information retrieval system performance are studied using Latent Semantic Indexing (LSI) on simulated data. The author hypothesize that LSI improve retrieval by improving the fit between a linear language model and non-linear data such as natural language text. The study analyzes how the values of three variables bear on optimizing k, the system's representational dimensionality. The variables studied are: the correlation between terms, the dimensionality of the untransformed termspace, and the number of documents in the collection. Using multinormally-distributed, stochastic matrices as input, precision/recall and average search length (ASL) are computed for differently modeled retrieval situatio...
The task in text retrieval is to find the subset of a collection of documents relevant to a user's ...
In recent years, we have seen a tremendous growth in the volume of online text documents available o...
Information Retrieval is concerned with locating information (usually text) that is relevant to a us...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Conventional vector based Information Retrieval (IR) models, Vector Space Model (VSM) and Generalize...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Abstract Experiments show that information retrieval and filtering can be much improved by Latent Se...
In recent years, we have seen a tremendous growth in the volume of text documents available on the I...
AbstractLatent semantic indexing (LSI) is an information retrieval technique based on the spectral a...
Information retrieval is much more challenging than traditional small document collection retrieval....
We describe an approach to information retrieval using Latent Semantic Indexing (LSI) that directly ...
We have previously described an extension of the vector retrieval method called "Latent Semanti...
In this paper we present a theoretical model for understanding the performance of Latent Semantic In...
The task of information retrieval is to extract relevant documents for a certain query from the coll...
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis ...
The task in text retrieval is to find the subset of a collection of documents relevant to a user's ...
In recent years, we have seen a tremendous growth in the volume of online text documents available o...
Information Retrieval is concerned with locating information (usually text) that is relevant to a us...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Conventional vector based Information Retrieval (IR) models, Vector Space Model (VSM) and Generalize...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Abstract Experiments show that information retrieval and filtering can be much improved by Latent Se...
In recent years, we have seen a tremendous growth in the volume of text documents available on the I...
AbstractLatent semantic indexing (LSI) is an information retrieval technique based on the spectral a...
Information retrieval is much more challenging than traditional small document collection retrieval....
We describe an approach to information retrieval using Latent Semantic Indexing (LSI) that directly ...
We have previously described an extension of the vector retrieval method called "Latent Semanti...
In this paper we present a theoretical model for understanding the performance of Latent Semantic In...
The task of information retrieval is to extract relevant documents for a certain query from the coll...
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis ...
The task in text retrieval is to find the subset of a collection of documents relevant to a user's ...
In recent years, we have seen a tremendous growth in the volume of online text documents available o...
Information Retrieval is concerned with locating information (usually text) that is relevant to a us...