How to sample training/validation data is an important question for machine learning models, especially when the dataset is heterogeneous and skewed. In this paper, we propose a data sampling method that robustly selects training/validation data. We formulate the training/validation data sampling process as a two-player game: a trainer aims to sample training data so as to minimize the test error, while a validator adversarially samples validation data that can increase the test error. Robust sampling is achieved at the game equilibrium. To accelerate the searching process, we adopt reinforcement learning aided Monte Carlo trees search (MCTS). We apply our method to a car-following modeling problem, a complicated scenario with heterogeneous...
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Machine learning often relies on costly labeled data, which impedes its application to new classific...
As a promising sequential decision-making algorithm, deep reinforcement learning (RL) has been appli...
In applied statistics, tools from machine learning are popular for analyzing complex and high-dimens...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
One of the main goal of Artificial Intelligence is to develop models capable of providing valuable p...
This thesis explores one of the most fundamental questions in Machine Learning, namely, how should t...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
We introduce a sampling perspective to tackle the challenging task of training robust Reinforcement ...
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Machine learning often relies on costly labeled data, which impedes its application to new classific...
As a promising sequential decision-making algorithm, deep reinforcement learning (RL) has been appli...
In applied statistics, tools from machine learning are popular for analyzing complex and high-dimens...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
In this dissertation, we explore two fundamental sets of inference problems arising in machine learn...
One of the main goal of Artificial Intelligence is to develop models capable of providing valuable p...
This thesis explores one of the most fundamental questions in Machine Learning, namely, how should t...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
Monte-Carlo Tree Search is a best-first search technique based on simulations to sample the state sp...
We introduce a sampling perspective to tackle the challenging task of training robust Reinforcement ...
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play....
Machine learning often relies on costly labeled data, which impedes its application to new classific...
As a promising sequential decision-making algorithm, deep reinforcement learning (RL) has been appli...