International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the unit hypersphere. This is particularly adapted for high-dimensional directional data such as texts. We propose in this article to estimate a von Mises mixture using a l1 penalized likelihood. This leads to sparse prototypes that improve both clustering quality and interpretability. We introduce an expectation-maximisation (EM) algorithm for this estimation and show the advantages of the approach on real data benchmark. We propose to explore the trade-off between the sparsity term and the likelihood one with a simple path following algorithm
We study estimation of mixture models for problems in which multiple views of the instances are avai...
International audienceData clustering has received a lot of attention and numerous methods, algorith...
Machine learning applications often involve data that can be analyzed as unit vectors on a d-dimensi...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
Several large scale data mining applications, such as text categorization and gene expression analys...
In this paper we consider both clustering and graphical modeling for given data. The clustering is t...
Address email Clustering is often formulated as the maximum likelihood estimation of a mixture model...
International audienceThis paper studies a new expectation maximization (EM) algorithm to estimate t...
Mixtures of von Mises-Fisher distributions have been shown to be an effective model for clustering d...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
International audienceVariable selection is fundamental to high-dimensional statistical modeling, an...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
The Expectation–Maximization (EM) algorithm is a popular tool in a wide variety of statistical setti...
We study estimation of mixture models for problems in which multiple views of the instances are avai...
International audienceData clustering has received a lot of attention and numerous methods, algorith...
Machine learning applications often involve data that can be analyzed as unit vectors on a d-dimensi...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
Several large scale data mining applications, such as text categorization and gene expression analys...
In this paper we consider both clustering and graphical modeling for given data. The clustering is t...
Address email Clustering is often formulated as the maximum likelihood estimation of a mixture model...
International audienceThis paper studies a new expectation maximization (EM) algorithm to estimate t...
Mixtures of von Mises-Fisher distributions have been shown to be an effective model for clustering d...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
International audienceVariable selection is fundamental to high-dimensional statistical modeling, an...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
The Expectation–Maximization (EM) algorithm is a popular tool in a wide variety of statistical setti...
We study estimation of mixture models for problems in which multiple views of the instances are avai...
International audienceData clustering has received a lot of attention and numerous methods, algorith...
Machine learning applications often involve data that can be analyzed as unit vectors on a d-dimensi...