This thesis describes a probabilistic model for optimum information retrieval in a distributed heterogeneous environment. The model assumes the collection of documents offered by the environment to be hierarchically partitioned into subcollections. Documents as well as subcollections have to be indexed. At this indexing methods using different indexing vocabularies can be employed. A query provided by a user is answered in terms of a ranked list of documents. The model determines a procedure for ranking the documents that stems from the Probability Ranking Principle: For each subcollection the subcollection´s elements are ranked; the resulting ranked lists are combined into a final ranked list of documents where the ordering is determined b...
Ranking is an important task for handling a large amount of content. Ideally, training data for supe...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
This thesis describes a probabilistic model for optimum information retrieval in a distributed heter...
This paper describes a model for optimum information retrieval over a distributed document collectio...
This thesis describes a probabilistic model for optimum information retrieval in a distributed heter...
In this paper a new model and architecture for information retrieval in a widely distributed heterog...
This poster session examines a probabilistic approach to distributed information retrieval using a L...
Text clustering is an established technique for improving quality in information retrieval, for both...
In this paper a generic probabilistic framework for the unsupervised hierarchical clustering of larg...
Abstract. This paper examines technology developed to support largescale distributed digital librari...
This short paper presents some preliminary effectiveness re-sults for a probabilistic approach to di...
Abstract. Learning preference models from human generated data is an important task in mod-ern infor...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
Ranking is an important task for handling a large amount of content. Ideally, training data for supe...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
This thesis describes a probabilistic model for optimum information retrieval in a distributed heter...
This paper describes a model for optimum information retrieval over a distributed document collectio...
This thesis describes a probabilistic model for optimum information retrieval in a distributed heter...
In this paper a new model and architecture for information retrieval in a widely distributed heterog...
This poster session examines a probabilistic approach to distributed information retrieval using a L...
Text clustering is an established technique for improving quality in information retrieval, for both...
In this paper a generic probabilistic framework for the unsupervised hierarchical clustering of larg...
Abstract. This paper examines technology developed to support largescale distributed digital librari...
This short paper presents some preliminary effectiveness re-sults for a probabilistic approach to di...
Abstract. Learning preference models from human generated data is an important task in mod-ern infor...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
Ranking is an important task for handling a large amount of content. Ideally, training data for supe...
This paper presents a probabilistic information retrieval framework in which the retrieval problem i...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...