Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learn-ing problems that involve graphs, such as content-based rec-ommendation. In this problem, each recommended item is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose two algorithms for solv-ing our problem that scale linearl...
A large number of online services provide automated recommendations to help users to navigate throug...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in manifold and semi-supervised learning. In this ...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Multi-Armed Bandit (MAB) framework has been successfully applied in many web applications. However, ...
We study recommendation in scenarios where there's no prior information about the quality of content...
We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank ma...
A large number of online services provide automated recommendations to help users to navigate throug...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...
Smooth functions on graphs have wide applications in man-ifold and semi-supervised learning. In this...
Smooth functions on graphs have wide applications in manifold and semi-supervised learning. In this ...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
International audienceSmooth functions on graphs have wide applications in manifold and semi-supervi...
With the rapid growth in velocity and volume, streaming data compels decision support systems to pre...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We study the problem of decision-making under uncertainty in the bandit setting. This thesis goes be...
In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) proble...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Multi-Armed Bandit (MAB) framework has been successfully applied in many web applications. However, ...
We study recommendation in scenarios where there's no prior information about the quality of content...
We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank ma...
A large number of online services provide automated recommendations to help users to navigate throug...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
International audienceThis paper addresses the on-line recommendation problem facing new users and n...