A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation

Müller, Matias I.
Valenzuela, Patricio Esteban
Proutiere, Alexandre
Rojas, Cristian R.

Open link

Publication date

January 2017

DOI

10.1109/CDC.2017.8264343

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

We study the problem of estimating the largest gain of an unknown linear and time-invariant filter, which is also known as the H∞ norm of the system. By using ideas from the stochastic multi-armed bandit framework, we present a new algorithm that sequentially designs an input signal in order to estimate this quantity by means of input-output data. The algorithm is shown empirically to beat an asymptotically optimal method, known as Thompson Sampling, in the sense of its cumulative regret function. Finally, for a general class of algorithms, a lower bound on the performance of finding the H-infinity norm is derived.QC 20180306</p

Extracted data

We use cookies to provide a better user experience.

Data Protection

A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation

Abstract

Extracted data

A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation

Abstract

Extracted data

Related items

Related items