This thesis deals with variable selection for clustering. This problem has become all the more challenging since the recent increase in high-dimensional data where the number of variables can largely exceeds the number of observations (DNA analysis, functional data clustering...). We propose a variable selection procedure for clustering suited to high-dimensional contexts. We consider clustering based on finite Gaussian mixture models in order to recast both the variable selection and the choice of the number of clusters into a global model selection problem. We use the variable selection property of l1-regularization to build a data-driven model collection in a efficient way. Our procedure differs from classical procedures using l1-regular...
This thesis proposes three original contributions for the clustering of particular types of data: mu...
More and more scientific studies yield to the collection of a large amount of data that consist of s...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
Il existe des situations de modélisation statistique pour lesquelles le problème classique de classi...
This PhD thesis deals with the following statistical problems: Variable selection in high-Dimensiona...
Au vu de l'augmentation du nombre de jeux de données de grande dimension, la sélection de variables ...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
We are interested in variable selection for clustering with Gaussian mixture models. This research i...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
This habilitation thesis retraces works focusing mainly on model based clustering and the related is...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
The reported works take place in the statistical framework of model-based clustering. We particularl...
Finite mixture regression models are useful for modeling the relationship between a response andpred...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
This thesis proposes three original contributions for the clustering of particular types of data: mu...
More and more scientific studies yield to the collection of a large amount of data that consist of s...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
Il existe des situations de modélisation statistique pour lesquelles le problème classique de classi...
This PhD thesis deals with the following statistical problems: Variable selection in high-Dimensiona...
Au vu de l'augmentation du nombre de jeux de données de grande dimension, la sélection de variables ...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
We are interested in variable selection for clustering with Gaussian mixture models. This research i...
This thesis deals with the problem of modeling and estimation of high-dimensional MoE models, toward...
This habilitation thesis retraces works focusing mainly on model based clustering and the related is...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
The reported works take place in the statistical framework of model-based clustering. We particularl...
Finite mixture regression models are useful for modeling the relationship between a response andpred...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
This thesis proposes three original contributions for the clustering of particular types of data: mu...
More and more scientific studies yield to the collection of a large amount of data that consist of s...
International audienceWe compare two major approaches to variable selection in clustering: model sel...