This paper presents a dissimilarity-based discriminative framework for learning from data coming in the form of probability distributions. Departing from the use of positive kernel-based methods, we build upon embeddings based on dissimilarities tailored for distribution. We enable this by extending \citet{balcan2008theory}'s theory of learning with similarity functions to the case of distribution-shaped data. Then, we show that several learning guarantees of the dissimilarity still hold when estimated from empirical distributions. Algorithmically, the proposed approach consists in building features from pairwise dissimilarities and in learning a linear decision function in this new feature space. Our experimenta...
The dissimilarity representation is an alternative for the use of features in the recognition of rea...
Many problems in unsupervised learning require the analysis of features of probability distributions...
Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest ...
This paper presents a dissimilarity-based discriminative framework for learning from data coming in...
A pre-printMeasures of similarity (or dissimilarity) are a key ingredient to many machine learning a...
We study the problem of classification when only a dissimilarity function between objects is accessi...
We study the problem of classification when only a dissimilarity function between objects is accessi...
This work explores statistical properties of machine learning algorithms from different perspectives...
We study the problem of learning a classification task in which only a dissimilarity function of the...
Any binary classifier (or score-function) can be used to define a dissimilarity between two distrib...
Existing distance metric learning methods require optimisation to learn a feature space to transform...
International audienceStatistical pattern recognition traditionally relies on features-based represe...
National audienceStatistical pattern recognition traditionally relies on a features based representa...
International audienceCorrectly estimating the discrepancy between two data distributions has always...
Kernel embeddings of distributions and the Maximum Mean Discrepancy (MMD), the resulting distance be...
The dissimilarity representation is an alternative for the use of features in the recognition of rea...
Many problems in unsupervised learning require the analysis of features of probability distributions...
Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest ...
This paper presents a dissimilarity-based discriminative framework for learning from data coming in...
A pre-printMeasures of similarity (or dissimilarity) are a key ingredient to many machine learning a...
We study the problem of classification when only a dissimilarity function between objects is accessi...
We study the problem of classification when only a dissimilarity function between objects is accessi...
This work explores statistical properties of machine learning algorithms from different perspectives...
We study the problem of learning a classification task in which only a dissimilarity function of the...
Any binary classifier (or score-function) can be used to define a dissimilarity between two distrib...
Existing distance metric learning methods require optimisation to learn a feature space to transform...
International audienceStatistical pattern recognition traditionally relies on features-based represe...
National audienceStatistical pattern recognition traditionally relies on a features based representa...
International audienceCorrectly estimating the discrepancy between two data distributions has always...
Kernel embeddings of distributions and the Maximum Mean Discrepancy (MMD), the resulting distance be...
The dissimilarity representation is an alternative for the use of features in the recognition of rea...
Many problems in unsupervised learning require the analysis of features of probability distributions...
Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest ...