International audienceSmooth functions on graphs have wide applications in manifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learning problems that involve graphs, such as content-based recommendation. In this problem, each item we can recommend is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret with respect to the optimal policy would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
We investigate the structural properties of certain sequential decision-making problems with limited...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
International audienceWe study a graph bandit setting where the objective of the learner is to detec...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
International audienceThompson Sampling (TS) has surged a lot of interest due to its good empirical ...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Abstract Most recommender systems recommend a list of items. The user examines the list, from the fi...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
We investigate the structural properties of certain sequential decision-making problems with limited...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
International audienceWe study a graph bandit setting where the objective of the learner is to detec...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
International audienceThompson Sampling (TS) has surged a lot of interest due to its good empirical ...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
Abstract Most recommender systems recommend a list of items. The user examines the list, from the fi...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
We investigate the structural properties of certain sequential decision-making problems with limited...