Local Clustering in Contextual Multi-Armed Bandits

Ban, Yikun
He, Jingrui

Publication date

July 2022

Language

English

Abstract

We study identifying user clusters in contextual multi-armed bandits (MAB). Contextual MAB is an effective tool for many real applications, such as content recommendation and online advertisement. In practice, user dependency plays an essential role in the user's actions, and thus the rewards. Clustering similar users can improve the quality of reward estimation, which in turn leads to more effective content recommendation and targeted advertising. Different from traditional clustering settings, we cluster users based on the unknown bandit parameters, which will be estimated incrementally. In particular, we define the problem of cluster detection in contextual MAB, and propose a bandit algorithm, LOCB, embedded with local clustering procedu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Local Clustering in Contextual Multi-Armed Bandits

Abstract

Extracted data

Local Clustering in Contextual Multi-Armed Bandits

Abstract

Extracted data

Related items

Related items