New England

Michal Valko
Inria Lille
Branislav Kveton
Shipra Agrawal

Publication date

January 2016

Abstract

Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learn-ing problems that involve graphs, such as content-based rec-ommendation. In this problem, each recommended item is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose two algorithms for solv-ing our problem that scale linearl...

Extracted data

We use cookies to provide a better user experience.

Data Protection

New England

Abstract

Extracted data

New England

Abstract

Extracted data

Topics

Related items

Topics

Related items