In recent years several well-known approaches to visualize the topical structure of a document collection have been proposed. Most of them feature spectral analysis of a term-document matrix with influence values and dimensionality reduction. We generalize this approach by arguing that there are many reasonable ways to project the term-document matrix into low-dimensional space in which different features of the corpus are emphasized. Our main tool is a continuous generalization of adjacency-respecting partitions called structural similarity. In this way we obtain a generic framework in which influence weights in the term-document matrix, dimensionality-reducing projections, and the display of a target subspace may be varied according to na...
Conceptual space can be carved up linguistically in different ways. The mapping between a set of rel...
This paper presents a new spectral clustering method called correlation preserving indexing (CPI), w...
Despite many technological advances, the information overload problem still prevails in many applica...
In my thesis I am presenting an approach of conceptual spaces for vizulalization of text corpora. Th...
This work presents a system for analysis and visualization of document collections based on lexical ...
Word Space Models (WSMs) are a statistical-computational technique to compare the collocational beha...
Abstract. The current availability of information many times impair the tasks of searching, browsing...
Conceptual space can be carved up linguistically in different ways. The mapping between a set of rel...
Latent semantic analysis (LSA) is a technique that analyzes relationships between documents and its ...
This study proposes and evaluates a document analysis strategy for information retrieval with visua...
Latent Semantic Analysis (LSA) is a technique that analyzes relationships between documents and its ...
Large amounts of data are only available in textual form. However, due to the semi-structured nature...
With the abundance of written information available online, it is useful to be able to automatically...
Document representation is important for computer-based text processing. Good document representatio...
AbstractParse thickets provide extensive descriptive representation of text information, based on st...
Conceptual space can be carved up linguistically in different ways. The mapping between a set of rel...
This paper presents a new spectral clustering method called correlation preserving indexing (CPI), w...
Despite many technological advances, the information overload problem still prevails in many applica...
In my thesis I am presenting an approach of conceptual spaces for vizulalization of text corpora. Th...
This work presents a system for analysis and visualization of document collections based on lexical ...
Word Space Models (WSMs) are a statistical-computational technique to compare the collocational beha...
Abstract. The current availability of information many times impair the tasks of searching, browsing...
Conceptual space can be carved up linguistically in different ways. The mapping between a set of rel...
Latent semantic analysis (LSA) is a technique that analyzes relationships between documents and its ...
This study proposes and evaluates a document analysis strategy for information retrieval with visua...
Latent Semantic Analysis (LSA) is a technique that analyzes relationships between documents and its ...
Large amounts of data are only available in textual form. However, due to the semi-structured nature...
With the abundance of written information available online, it is useful to be able to automatically...
Document representation is important for computer-based text processing. Good document representatio...
AbstractParse thickets provide extensive descriptive representation of text information, based on st...
Conceptual space can be carved up linguistically in different ways. The mapping between a set of rel...
This paper presents a new spectral clustering method called correlation preserving indexing (CPI), w...
Despite many technological advances, the information overload problem still prevails in many applica...