Robust Data Sampling in Machine Learning: A Game-Theoretic Framework for Training and Validation Data Selection

Zhaobin Mo
Xuan Di
Rongye Shi

Open link

Publication date

January 2023

DOI

10.3390/g14010013

Publisher

MDPI AG

Journal

Games

Abstract

How to sample training/validation data is an important question for machine learning models, especially when the dataset is heterogeneous and skewed. In this paper, we propose a data sampling method that robustly selects training/validation data. We formulate the training/validation data sampling process as a two-player game: a trainer aims to sample training data so as to minimize the test error, while a validator adversarially samples validation data that can increase the test error. Robust sampling is achieved at the game equilibrium. To accelerate the searching process, we adopt reinforcement learning aided Monte Carlo trees search (MCTS). We apply our method to a car-following modeling problem, a complicated scenario with heterogeneous...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Robust Data Sampling in Machine Learning: A Game-Theoretic Framework for Training and Validation Data Selection

Abstract

Extracted data

Robust Data Sampling in Machine Learning: A Game-Theoretic Framework for Training and Validation Data Selection

Abstract

Extracted data

Related items

Related items