This paper addresses the problem of clustering data when the available data measurements are not multivariate vectors of fixed dimensionality. For example, one might have data from a set of medical patients, where for each patient there are time series, image, text, and multivariate data. We propose a general probabilistic clustering framework for clustering heterogeneous data types of this form. We focus on two-level probabilistic hierarchical models, consisting of a high-level mixture model on parameters and a low-level model for observations. This general framework permits probabilistic clustering of "objects" (sequences, histograms, images, etc) using an extension of the expectation-maximization (EM) algorithm which we derive. We furthe...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
This paper addresses the problem of clustering data when the available data measurements are not mul...
In this paper a generic probabilistic framework for the unsupervised hierarchical clustering of larg...
The problem of clustering probability density functions is emerging in different scientific domains....
The problem of clustering probability density functions is emerging in different scientific domains....
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
none3noThe problem of clustering probability density functions is emerging in different scientific d...
Biological data, such as gene expression profiles or protein sequences, is often organized in a hier...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
Many domains are naturally organized in an abstraction hierarchy or taxonomy, where the instances in...
Objective: In this paper, we focused on devel- oping a clustering approach for biological data. In m...
International audienceMost clustering and classification methods are based on the assumption that th...
International audienceMost clustering and classification methods are based on the assumption that th...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
This paper addresses the problem of clustering data when the available data measurements are not mul...
In this paper a generic probabilistic framework for the unsupervised hierarchical clustering of larg...
The problem of clustering probability density functions is emerging in different scientific domains....
The problem of clustering probability density functions is emerging in different scientific domains....
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
none3noThe problem of clustering probability density functions is emerging in different scientific d...
Biological data, such as gene expression profiles or protein sequences, is often organized in a hier...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
Many domains are naturally organized in an abstraction hierarchy or taxonomy, where the instances in...
Objective: In this paper, we focused on devel- oping a clustering approach for biological data. In m...
International audienceMost clustering and classification methods are based on the assumption that th...
International audienceMost clustering and classification methods are based on the assumption that th...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...
34 pages, 11 figuresInternational audienceCount data is becoming more and more ubiquitous in a wide ...