The goal of machine learning is to develop predictors that generalize well to test data. Ideally, this is achieved by training on very large (infinite) training data sets that cap-ture all variations in the data distribution. In the case of finite training data, an effec-tive solution is to extend the training set with artificially created examples—which, how-ever, is also computationally costly. We pro-pose to corrupt training examples with noise from known distributions within the expo-nential family and present a novel learning algorithm, called marginalized corrupted fea-tures (MCF), that trains robust predictors by minimizing the expected value of the loss function under the corrupting distribution— essentially learning with infinitely...
Despite their impressive performance on large-scale benchmarks, machine learning sys- tems turn out ...
We describe and analyze efficient algorithms for learning a linear predictor from examples when the ...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
The goal of machine learning is to develop predictors that generalize well to test data. Ideally, th...
Abstract A common assumption in supervised machine learning is that the training exam-ples provided ...
Denoising auto-encoders (DAEs) have been suc-cessfully used to learn new representations for a wide ...
Denoising auto-encoders (DAEs) have been suc-cessfully used to learn new representations for a wide ...
We consider a general statistical learning problem where an unknown fraction of the training data is...
Machine learning algorithms are invented to learn from data and to use data to perform predictions a...
Robustness of machine learning, often referring to securing performance on different data, is always...
We study the performance -- and specifically the rate at which the error probability converges to ze...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Regularization techniques have become a principled tool for model-based statistics and artificial in...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Despite their impressive performance on large-scale benchmarks, machine learning sys- tems turn out ...
We describe and analyze efficient algorithms for learning a linear predictor from examples when the ...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...
The goal of machine learning is to develop predictors that generalize well to test data. Ideally, th...
Abstract A common assumption in supervised machine learning is that the training exam-ples provided ...
Denoising auto-encoders (DAEs) have been suc-cessfully used to learn new representations for a wide ...
Denoising auto-encoders (DAEs) have been suc-cessfully used to learn new representations for a wide ...
We consider a general statistical learning problem where an unknown fraction of the training data is...
Machine learning algorithms are invented to learn from data and to use data to perform predictions a...
Robustness of machine learning, often referring to securing performance on different data, is always...
We study the performance -- and specifically the rate at which the error probability converges to ze...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Regularization techniques have become a principled tool for model-based statistics and artificial in...
Addressing fairness concerns about machine learning models is a crucial step towards their long-term...
Despite their impressive performance on large-scale benchmarks, machine learning sys- tems turn out ...
We describe and analyze efficient algorithms for learning a linear predictor from examples when the ...
Modern technological advances have prompted massive scale data collection in manymodern fields such ...