We propose a framework MIC (Multiple Inclusion Criterion) for learning sparse models based on the information theoretic Minimum Description Length (MDL) principle. MIC provides an elegant way of incorporating arbitrary sparsity patterns in the feature space by using two-part MDL coding schemes. We present MIC based models for the problems of grouped feature selection (MIC-GROUP) and multi-task feature selection (MIC-MULTI). MIC-GROUP assumes that the features are divided into groups and induces two level sparsity, selecting a subset of the feature groups, and also selecting features within each selected group. MIC-MULTI applies when there are multiple related tasks that share the same set of potentially predictive features. It also induces ...
Subspace segmentation is the process of clustering a set of data points that are assumed to lie on t...
The power of sparse signal coding with learned dictionaries has been demonstrated in a variety of ap...
abstract: Advances in data collection technologies have made it cost-effective to obtain heterogeneo...
We propose a framework MIC (Multiple Inclusion Criterion) for learning sparse models based on the in...
Multi-task learning seeks to improve the generalization performance by sharing common information am...
Model sparsification in deep learning promotes simpler, more interpretable models with fewer paramet...
Given the "right" representation, learning is easy. This thesis studies representation learning and ...
In statistical machine learning, kernel methods allow to consider infinite dimensional feature space...
International audienceIn supervised classification, data representation is usually considered at the...
Today we are living in a world awash with data. Large volumes of data are acquired, analyzed and app...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
This paper investigates, from information theoretic grounds, a learning problem based on the princip...
Irrelevant and redundant features may reduce both predictive accuracy and comprehensibility of induc...
This is an electronic version of the paper presented at the 27 Annual Conference on Neural Informati...
We present three related ways of using Transfer Learning to improve feature selection. The three met...
Subspace segmentation is the process of clustering a set of data points that are assumed to lie on t...
The power of sparse signal coding with learned dictionaries has been demonstrated in a variety of ap...
abstract: Advances in data collection technologies have made it cost-effective to obtain heterogeneo...
We propose a framework MIC (Multiple Inclusion Criterion) for learning sparse models based on the in...
Multi-task learning seeks to improve the generalization performance by sharing common information am...
Model sparsification in deep learning promotes simpler, more interpretable models with fewer paramet...
Given the "right" representation, learning is easy. This thesis studies representation learning and ...
In statistical machine learning, kernel methods allow to consider infinite dimensional feature space...
International audienceIn supervised classification, data representation is usually considered at the...
Today we are living in a world awash with data. Large volumes of data are acquired, analyzed and app...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
This paper investigates, from information theoretic grounds, a learning problem based on the princip...
Irrelevant and redundant features may reduce both predictive accuracy and comprehensibility of induc...
This is an electronic version of the paper presented at the 27 Annual Conference on Neural Informati...
We present three related ways of using Transfer Learning to improve feature selection. The three met...
Subspace segmentation is the process of clustering a set of data points that are assumed to lie on t...
The power of sparse signal coding with learned dictionaries has been demonstrated in a variety of ap...
abstract: Advances in data collection technologies have made it cost-effective to obtain heterogeneo...