UniRank: Unimodal Bandit Algorithm for Online Ranking

Gauthier, Camille-Sovanneary
Gaudel, Romaric
Fromont, Elisa

Publication date

July 2022

Publisher

HAL CCSD

Abstract

International audienceWe tackle, in the multiple-play bandit setting, the online ranking problem of assigning $L$ items to $K$ predefined positions on a web page in order to maximize the number of user clicks. We propose a generic algorithm, UniRank, that tackles state-of-the-art click models. The regret bound of this algorithm is a direct consequence of the unimodality-like property of the bandit setting with respect toa graph where nodes are ordered sets of indistinguishable items.The main contribution of UniRank is its $O\left(L/\Delta \log T\right)$ regret for $T$ consecutive assignments, where $\Delta$ relates to the reward-gap between two items.This regret bound is based on the usually implicit condition that two items may not have th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

UniRank: Unimodal Bandit Algorithm for Online Ranking

Abstract

Extracted data

UniRank: Unimodal Bandit Algorithm for Online Ranking

Abstract

Extracted data

Related items

Related items