International audienceSmooth functions on graphs have wide applications in manifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learning problems that involve graphs, such as content-based recommendation. In this problem, each item we can recommend is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret with respect to the optimal policy would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
We investigate the structural properties of certain sequential decision-making problems with limited...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
International audienceThompson Sampling (TS) has surged a lot of interest due to its good empirical ...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
International audienceWe study a graph bandit setting where the objective of the learner is to detec...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
Thompson Sampling (TS) has surged a lot of interest due to its good empirical performance, in partic...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
We investigate the structural properties of certain sequential decision-making problems with limited...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
International audienceThompson Sampling (TS) has surged a lot of interest due to its good empirical ...
We consider stochastic sequential learning problems where the learner can observe the \textit{averag...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
International audienceWe study a graph bandit setting where the objective of the learner is to detec...
International audienceWe consider stochastic sequential learning problems where the learner can obse...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
Thompson Sampling (TS) has surged a lot of interest due to its good empirical performance, in partic...
We study a type of recommendation systems problem, in which the system must be able to cover as many...
We investigate the structural properties of certain sequential decision-making problems with limited...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...