This paper proposes a novel statistical approach to intelligent document re-trieval. It seeks to offer a more structured and extensible mathematical approach to the term generalization done in the popular Latent Semantic Analysis (LSA) approach to document indexing. A Markov Random Field (MRF) is presented that captures relationships between terms and docu-ments as probabilistic dependence assumptions between random variables. From there, it uses the MRF-Gibbs equivalence to derive joint probabilities as well as local probabilities for document variables. A parameter learning method is proposed that utilizes rank reduction with singular value decom-position in a matter similar to LSA to reduce dimensionality of document-term relationships t...
Topic models have shown to be one of the most effective tools in Content-Based Multimedia Retrieval...
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis ...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
A Markovian Random Field approach is proposed for Automatic Information Retrieval in full text docum...
We present a Markovian Random Field modeling for thematic knowledge extraction in text. An analogy i...
Current state of the art information retrieval models treat documents and queries as bags of words. ...
We describe a method for probabilistic document indexing using relevance feedback data that has been...
. We present a Markovian Random Field modeling for thematic knowledge extraction in text. An analogy...
In recent years probabilistic topic models have gained tremendous attention in data mining and natur...
Abstract Experiments show that information retrieval and filtering can be much improved by Latent Se...
C1 - Journal Articles RefereedProbabilistic latent semantic analysis (PLSA) is a method for computin...
Probabilistic latent semantic analysis (PLSA) is a method for computing term and document relationsh...
When a user types in a search query in an Information Retrieval system, a list of top ‘n’ ranked doc...
When a user types in a search query in an Information Retrieval system, a list of top ‘n’ ranked doc...
Topic models have shown to be one of the most effective tools in Content-Based Multimedia Retrieval...
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis ...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
A Markovian Random Field approach is proposed for Automatic Information Retrieval in full text docum...
We present a Markovian Random Field modeling for thematic knowledge extraction in text. An analogy i...
Current state of the art information retrieval models treat documents and queries as bags of words. ...
We describe a method for probabilistic document indexing using relevance feedback data that has been...
. We present a Markovian Random Field modeling for thematic knowledge extraction in text. An analogy...
In recent years probabilistic topic models have gained tremendous attention in data mining and natur...
Abstract Experiments show that information retrieval and filtering can be much improved by Latent Se...
C1 - Journal Articles RefereedProbabilistic latent semantic analysis (PLSA) is a method for computin...
Probabilistic latent semantic analysis (PLSA) is a method for computing term and document relationsh...
When a user types in a search query in an Information Retrieval system, a list of top ‘n’ ranked doc...
When a user types in a search query in an Information Retrieval system, a list of top ‘n’ ranked doc...
Topic models have shown to be one of the most effective tools in Content-Based Multimedia Retrieval...
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis ...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...