Latent Semantic Indexing (LSI) approach provides a promising solution to overcome the language barrier between queries and documents, but unfortunately the high dimensions of the training matrix is computationally prohibitive for its key step of Singular Value Decomposition (SVD). Based on the semantic parallelism of the multi-linguistic training corpus we prove in this paper that, theoretically if the training term-by-document matrix can appear in either of two symmetry forms, strong or weak, the dimension of the matrix under decomposition can be reduced to the size of a monolingual matrix. The retrieval accuracy will not deteriorate in such a simplification. And we also discuss what these two forms of symmetry mean in the context of multi...
We describe a method for fully automated cross-language document retrieval in which no query transla...
Text retrieval using Latent Semantic Indexing (LSI) with truncated Singular Value Decomposition (SVD...
Lexical-matching methods for information retrieval can be inaccurate when they are used to match a u...
[[abstract]]Latent Semantic Indexing (LSI) is a retrieval technique that employs Singular Value Deco...
In this paper, we report the utilization of a large-scaled bilingual corpus in Cross-Language Latent...
Abstract—LSI usually is conducted by using the singular value decomposition (SVD). The main difficul...
Latent semantic indexing (LSI) is a method of information retrieval that relies heavily on the parti...
Cross-lingual information retrieval is a difficult task typically involving query translation into m...
Latent Semantic Indexing (LSI) has been successfully applied to information retrieval and classifica...
With the electronic storage of documents comes the possibility of building search engines that can ...
When people search for documents, they eventually want content, not words. Hence, search engines sho...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Our capabilities for collecting and storing data of all kinds are greater then ever. On the other si...
In this paper we present a theoretical model for understanding the performance of Latent Semantic In...
Text retrieval using Latent Semantic Indexing (LSI) with truncated Singular Value Decomposition (SVD...
We describe a method for fully automated cross-language document retrieval in which no query transla...
Text retrieval using Latent Semantic Indexing (LSI) with truncated Singular Value Decomposition (SVD...
Lexical-matching methods for information retrieval can be inaccurate when they are used to match a u...
[[abstract]]Latent Semantic Indexing (LSI) is a retrieval technique that employs Singular Value Deco...
In this paper, we report the utilization of a large-scaled bilingual corpus in Cross-Language Latent...
Abstract—LSI usually is conducted by using the singular value decomposition (SVD). The main difficul...
Latent semantic indexing (LSI) is a method of information retrieval that relies heavily on the parti...
Cross-lingual information retrieval is a difficult task typically involving query translation into m...
Latent Semantic Indexing (LSI) has been successfully applied to information retrieval and classifica...
With the electronic storage of documents comes the possibility of building search engines that can ...
When people search for documents, they eventually want content, not words. Hence, search engines sho...
Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retriev...
Our capabilities for collecting and storing data of all kinds are greater then ever. On the other si...
In this paper we present a theoretical model for understanding the performance of Latent Semantic In...
Text retrieval using Latent Semantic Indexing (LSI) with truncated Singular Value Decomposition (SVD...
We describe a method for fully automated cross-language document retrieval in which no query transla...
Text retrieval using Latent Semantic Indexing (LSI) with truncated Singular Value Decomposition (SVD...
Lexical-matching methods for information retrieval can be inaccurate when they are used to match a u...