An important information access problem arises when the user is confronted with a very large number of documents that have been retrieved in response to a query. In this paper we explore the use of a technique, called Scatter/Gather, for the navigation of large col-lections of retrieved documents. Scatter/Gather clus-ters the documents into semantically coherent groups on-the-fly and presents descriptive summaries of the groups to the user. These groups can be used in sev-eral ways: to identify useful subsets of documents to be perused with other tools, to eliminate sub-sets whose contents appear nonrelevant, or to se-lect promising document subsets for reclustering into more refined groups. This paper describes the Scat-ter/Gather algorith...
The amount of music files on the Internet keeps on growing, and there is a need for easier navigatio...
Information retrieval can be likened to a mining process. Searchers drill through a document space ...
This paper presents a novel approach for search engine results clustering that relies on the semanti...
The Scatter/Gather document browsing method uses fast document clustering to produce table-of-conten...
The Scatter/Gather document browsing method uses fast document clustering to produce table-of-conten...
As highly structured documents with rich metadata (such as products, movies, etc.) become increasing...
Scatter storage schemes are examined with respect to their applicability to dictionary lookup proced...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
This thesis presents new methods for classification and thematic grouping of billions of web pages, ...
We present a new visualization approach for metadata combining different visualizations into a so-ca...
The huge volume of text documents available on the internet has made it difficult to find valuable i...
Successful knowledge management requires efficient tools to manage information in the form of text. ...
textLatent variable models such as Latent Dirichlet Allocation provide rich tools for analyzing larg...
This paper presents different methods tested by the University of Avignon and Bertin at the TREC-7 e...
The amount of music files on the Internet keeps on growing, and there is a need for easier navigatio...
Information retrieval can be likened to a mining process. Searchers drill through a document space ...
This paper presents a novel approach for search engine results clustering that relies on the semanti...
The Scatter/Gather document browsing method uses fast document clustering to produce table-of-conten...
The Scatter/Gather document browsing method uses fast document clustering to produce table-of-conten...
As highly structured documents with rich metadata (such as products, movies, etc.) become increasing...
Scatter storage schemes are examined with respect to their applicability to dictionary lookup proced...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
This thesis presents new methods for classification and thematic grouping of billions of web pages, ...
We present a new visualization approach for metadata combining different visualizations into a so-ca...
The huge volume of text documents available on the internet has made it difficult to find valuable i...
Successful knowledge management requires efficient tools to manage information in the form of text. ...
textLatent variable models such as Latent Dirichlet Allocation provide rich tools for analyzing larg...
This paper presents different methods tested by the University of Avignon and Bertin at the TREC-7 e...
The amount of music files on the Internet keeps on growing, and there is a need for easier navigatio...
Information retrieval can be likened to a mining process. Searchers drill through a document space ...
This paper presents a novel approach for search engine results clustering that relies on the semanti...