In the first part of this thesis, we present a new understanding of the latent semantic space of a dataset from the dual perspective, which relaxes the above assumed conditions and leads naturally to a unified kernel function for a class of vector space models. New semantic analysis methods based on the unified kernel function are developed, which combine the advantages of LSI and GVSM. We also show that the new methods possess the stable property on the rank choice, i.e., even if the selected rank is quite far away from the optimal one, the retrieval performance will not degrade much. The experimental results of our methods on the standard test sets are promising.In the second part of this thesis, we propose that the mathematical structure...
Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM con...
Latent semantic indexing (LSI) can be used in patent searching to overcome drawbacks of Boolean sear...
Abstract. This paper presents a robust method for the construction of collection-specic document mod...
The Basic Vector Space Model (BVSM) is well known in information retrieval. Unfortunately, its retri...
The method of latent semantic indexing (LSI) is well-known for tackling the synonymy and polysemy pr...
Latent semantic analysis (LSA)is based on the concept of vector space mod-els, an approach using lin...
One of the most prominent trends of our time is the emergence of an information society. The amount ...
Vector Space Model (GVSM) represents documents and queries as vectors in a multidimensional space. T...
International audienceKernels are widely used in Natural Language Processing as similarity measures ...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Our capabilities for collecting and storing data of all kinds are greater then ever. On the other si...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Abstract. This paper presents a robust method for the construction of collection-specific document m...
The task of information retrieval is to extract relevant documents for a certain query from the coll...
The main aim of this project is to apply Salton’s vector space model (VSM), and compare other method...
Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM con...
Latent semantic indexing (LSI) can be used in patent searching to overcome drawbacks of Boolean sear...
Abstract. This paper presents a robust method for the construction of collection-specic document mod...
The Basic Vector Space Model (BVSM) is well known in information retrieval. Unfortunately, its retri...
The method of latent semantic indexing (LSI) is well-known for tackling the synonymy and polysemy pr...
Latent semantic analysis (LSA)is based on the concept of vector space mod-els, an approach using lin...
One of the most prominent trends of our time is the emergence of an information society. The amount ...
Vector Space Model (GVSM) represents documents and queries as vectors in a multidimensional space. T...
International audienceKernels are widely used in Natural Language Processing as similarity measures ...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Our capabilities for collecting and storing data of all kinds are greater then ever. On the other si...
In this work we present a study of different techniques for semantic indexing by dimension reduction...
Abstract. This paper presents a robust method for the construction of collection-specific document m...
The task of information retrieval is to extract relevant documents for a certain query from the coll...
The main aim of this project is to apply Salton’s vector space model (VSM), and compare other method...
Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM con...
Latent semantic indexing (LSI) can be used in patent searching to overcome drawbacks of Boolean sear...
Abstract. This paper presents a robust method for the construction of collection-specic document mod...