A novel approach to the gain estimation problem,using a multi-armed bandit formulation, is studied. The gain estimation problem deals with the problem of estimating the largest L2-gain that signal of bounded norm experiences when passing through a linear and time-invariant system. Under certain conditions, this new approach is guaranteed to surpass traditional System Identification methods in terms of accuracy.The bandit algorithms Upper Confidence Bound, Thompson Sampling and Weighted Thompson Sampling are implemented with the aim of designing the optimal input for maximizing the gain of an unknown system. The regret performance of each algorithm is studied using simulations on a test system. Upper Confidence Bound, with exploration parame...
We analyze and compare methods to estimate the 2-gain ( ∞-norm) of a stable linear dynamical syste...
International audienceAlgorithms based on upper confidence bounds for balancing exploration and expl...
Computer experiments are widely used to mimic expensive physical processes as black-box functions. A...
A novel approach to the gain estimation problem,using a multi-armed bandit formulation, is studied. ...
We present the gain estimation problem for linear dynamical systems as a multi-armed bandit. This is...
We study the problem of estimating the largest gain of an unknown linear and time-invariant filter, ...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
In this paper we present and discuss some data-driven methods for estimation of the L2-gain of dynam...
This paper considers the use of a simple posterior sampling algorithm to balance between exploration...
Modifying the reward-biased maximum likelihood method originally proposed in the adaptive control li...
Abstract. In this paper, we study the problem of estimating the mean values of all the arms uniforml...
Many applications require optimizing an un-known, noisy function that is expensive to evaluate. We f...
In this paper, we consider stochastic multi-armed bandits (MABs) with heavy-tailed rewards, whose p-...
International audienceAlgorithms based on upper-confidence bounds for balancing exploration and expl...
We analyze and compare methods to estimate the 2-gain ( ∞-norm) of a stable linear dynamical syste...
International audienceAlgorithms based on upper confidence bounds for balancing exploration and expl...
Computer experiments are widely used to mimic expensive physical processes as black-box functions. A...
A novel approach to the gain estimation problem,using a multi-armed bandit formulation, is studied. ...
We present the gain estimation problem for linear dynamical systems as a multi-armed bandit. This is...
We study the problem of estimating the largest gain of an unknown linear and time-invariant filter, ...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We fo...
In this paper we present and discuss some data-driven methods for estimation of the L2-gain of dynam...
This paper considers the use of a simple posterior sampling algorithm to balance between exploration...
Modifying the reward-biased maximum likelihood method originally proposed in the adaptive control li...
Abstract. In this paper, we study the problem of estimating the mean values of all the arms uniforml...
Many applications require optimizing an un-known, noisy function that is expensive to evaluate. We f...
In this paper, we consider stochastic multi-armed bandits (MABs) with heavy-tailed rewards, whose p-...
International audienceAlgorithms based on upper-confidence bounds for balancing exploration and expl...
We analyze and compare methods to estimate the 2-gain ( ∞-norm) of a stable linear dynamical syste...
International audienceAlgorithms based on upper confidence bounds for balancing exploration and expl...
Computer experiments are widely used to mimic expensive physical processes as black-box functions. A...