Abstract—Supervised learning is a classic data mining problem where one wishes to be able to predict an output value associated with a particular input vector. We present a new twist on this classic problem where, instead of having the training set contain an individual output value for each input vector, the output values in the training set are only given in aggregate over a number of input vectors. This new problem arose from a particular need in learning on mass spectrometry data, but could easily apply to situations when data has been aggregated in order to maintain privacy. We provide a formal description of this new problem for both classification and regression. We then examine how k-nearest neighbor, neural networks, support vector...
Machine learning algorithms are used to train the machine to learn on its own and improve from exper...
Abstract Supervised learning accounts for a lot of research activity in machine learning and many su...
The volume of data generated and collected using modern technologies grows exponentially. This vast ...
Regression uses supervised machine learning to find a model that combines several independent variab...
The solution of classification problems using statistical techniques requires appropriately labelled...
Due to the privacy protection or the difficulty of data collection, we cannot observe individual out...
This paper presents an assembling unsupervised learning framework that adopts the information coming...
© 2016 Elsevier Inc. This paper introduces a novel algorithm, called Supervised Aggregated FEature l...
We consider the problem of learning a neural network classifier. Under the information bottleneck (I...
In this paper, we introduce a new learning strategy based on a seminal idea of Mojirsheibani (1999, ...
Nowadays, many machine learning procedures are available on the shelve and may be used easily to cal...
In this paper a novel data mining algorithm, Clustering and Classification Algorithm-Supervised (CCA...
Various real-world applications involve directly dealing with aggregate data. In this work, we study...
Aggregating a dataset, then injecting some noise, is a simple and common way to release differential...
Census data provide detailed information about population characteristics at a coarse resolution. Ne...
Machine learning algorithms are used to train the machine to learn on its own and improve from exper...
Abstract Supervised learning accounts for a lot of research activity in machine learning and many su...
The volume of data generated and collected using modern technologies grows exponentially. This vast ...
Regression uses supervised machine learning to find a model that combines several independent variab...
The solution of classification problems using statistical techniques requires appropriately labelled...
Due to the privacy protection or the difficulty of data collection, we cannot observe individual out...
This paper presents an assembling unsupervised learning framework that adopts the information coming...
© 2016 Elsevier Inc. This paper introduces a novel algorithm, called Supervised Aggregated FEature l...
We consider the problem of learning a neural network classifier. Under the information bottleneck (I...
In this paper, we introduce a new learning strategy based on a seminal idea of Mojirsheibani (1999, ...
Nowadays, many machine learning procedures are available on the shelve and may be used easily to cal...
In this paper a novel data mining algorithm, Clustering and Classification Algorithm-Supervised (CCA...
Various real-world applications involve directly dealing with aggregate data. In this work, we study...
Aggregating a dataset, then injecting some noise, is a simple and common way to release differential...
Census data provide detailed information about population characteristics at a coarse resolution. Ne...
Machine learning algorithms are used to train the machine to learn on its own and improve from exper...
Abstract Supervised learning accounts for a lot of research activity in machine learning and many su...
The volume of data generated and collected using modern technologies grows exponentially. This vast ...