We consider the general problem of Multiple Model Learning (MML) from data, from the statistical and algorithmic perspectives; this problem includes clustering, multiple regression and subspace clustering as special cases. A common approach to solving new MML problems is to generalize Lloyd’s algorithm for clustering (or Expectation-Maximization for soft clustering). However this approach is un-fortunately sensitive to outliers and large noise: a single exceptional point may take over one of the models. We propose a different general formulation that seeks for each model a distribu-tion over data points; the weights are regularized to be sufficiently spread out. This enhances robustness by making assumptions on class balance. We further pro...
We address the problem of recovering multiple structures of different classes in a dataset contamina...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
The existing Multi-View Learning (MVL) is to discuss how to learn from patterns with multiple inform...
We consider the general problem of Multiple Model Learning (MML) from data, from the statistical and...
As the ability to collect and store data improving, real data are usually made up of different forms...
Submitted to the School of Electronic and Computer Engineering in partial fulfillment of the require...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
International audienceThis paper proposes a simple approach to derive efficient error bounds for lea...
The problem of variable clustering is that of estimating groups of similar components of a p-dimensi...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
We consider multi-task learning in the setting of multiple linear regression, and where some relevan...
Regularization techniques have become a principled tool for model-based statistics and artificial in...
Multi-clustering, which tries to find multiple independent ways to partition a data set into groups,...
Exploiting the information from multiple views can improve clustering accuracy. However, most existi...
In multivariate datasets, multiple clustering solutions can be obtained, based on different subsets ...
We address the problem of recovering multiple structures of different classes in a dataset contamina...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
The existing Multi-View Learning (MVL) is to discuss how to learn from patterns with multiple inform...
We consider the general problem of Multiple Model Learning (MML) from data, from the statistical and...
As the ability to collect and store data improving, real data are usually made up of different forms...
Submitted to the School of Electronic and Computer Engineering in partial fulfillment of the require...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
International audienceThis paper proposes a simple approach to derive efficient error bounds for lea...
The problem of variable clustering is that of estimating groups of similar components of a p-dimensi...
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints betw...
We consider multi-task learning in the setting of multiple linear regression, and where some relevan...
Regularization techniques have become a principled tool for model-based statistics and artificial in...
Multi-clustering, which tries to find multiple independent ways to partition a data set into groups,...
Exploiting the information from multiple views can improve clustering accuracy. However, most existi...
In multivariate datasets, multiple clustering solutions can be obtained, based on different subsets ...
We address the problem of recovering multiple structures of different classes in a dataset contamina...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
The existing Multi-View Learning (MVL) is to discuss how to learn from patterns with multiple inform...