We study the interplay between feedback and communication in a cooperative online learning setting where a network of agents solves a task in which the learners' feedback is determined by an arbitrary graph. We characterize regret in terms of the independence number of the strong product between the feedback graph and the communication network. Our analysis recovers as special cases many previously known bounds for distributed online learning with either expert or bandit feedback. A more detailed version of our results also captures the dependence of the regret on the delay caused by the time the information takes to traverse each graph. Experiments run on synthetic data show that the empirical behavior of our algorithm is consistent with t...
International audienceWe consider the problem of asynchronous online combinatorial optimization on a...
International audienceWe consider a game-theoretical multi-agent learning problem where the feedback...
International audienceWe consider a model of game-theoretic learning based on online mirro...
We study networks of communicating learning agents that cooperate to solve a common nonstochastic ba...
We study an asynchronous online learning setting with a network of agents. At each time step, some o...
We study networks of communicating learning agents that cooperate to solve a common nonstochastic ba...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
We consider distributed online learning for joint regret with communication constraints. In this se...
We consider a collaborative online learning paradigm, wherein a group of agents connected through a ...
International audienceIn this paper, we provide a general framework for studying multi-agent online ...
Abstract We consider a sequential learning problem with Gaussian payoffs and side observations: afte...
This study considers online learning with general directed feedback graphs. For this problem, we pre...
The framework of feedback graphs is a generalization of sequential decision-making with bandit or fu...
Abstract—In many types of multi-agent systems, distributed agents cooperate with each other to take ...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
International audienceWe consider the problem of asynchronous online combinatorial optimization on a...
International audienceWe consider a game-theoretical multi-agent learning problem where the feedback...
International audienceWe consider a model of game-theoretic learning based on online mirro...
We study networks of communicating learning agents that cooperate to solve a common nonstochastic ba...
We study an asynchronous online learning setting with a network of agents. At each time step, some o...
We study networks of communicating learning agents that cooperate to solve a common nonstochastic ba...
We introduce and study a partial-information model of online learning, where a decision maker repeat...
We consider distributed online learning for joint regret with communication constraints. In this se...
We consider a collaborative online learning paradigm, wherein a group of agents connected through a ...
International audienceIn this paper, we provide a general framework for studying multi-agent online ...
Abstract We consider a sequential learning problem with Gaussian payoffs and side observations: afte...
This study considers online learning with general directed feedback graphs. For this problem, we pre...
The framework of feedback graphs is a generalization of sequential decision-making with bandit or fu...
Abstract—In many types of multi-agent systems, distributed agents cooperate with each other to take ...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
International audienceWe consider the problem of asynchronous online combinatorial optimization on a...
International audienceWe consider a game-theoretical multi-agent learning problem where the feedback...
International audienceWe consider a model of game-theoretic learning based on online mirro...