Consider the following game: There is a fixed set V of n items. At each step an adversary chooses a score function st: V 7 → [0, 1], a learner out-puts a ranking of V, and then st is revealed. The learner’s loss is the sum over v ∈ V, of st(v) times v’s position (0th, 1st, 2nd,...) in the rank-ing. This problem captures, for example, on-line systems that iteratively present ranked lists of items to users, who then respond by choosing one (or more) sought items. The loss measures the users ’ burden, which increases the further the sought items are from the top. It also captures a version of online rank aggregation. We present an algorithm of expected regret O(n OPT + n2), where OPT is the loss of the best (single) ranking in hindsight. This ...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
Learning algorithms are now routinely applied to data aggregated from millions of untrusted users, i...
We consider an online learning framework where the task is to predict a permutation which represents...
We consider a setting where a system learns to rank a fixed set of m items. The goal is produce a go...
We consider prediction with expert advice when the loss vectors are assumed to lie in a set describ...
We study a general class of learning algorithms, which we call regret-matching algorithms, along wit...
We study online aggregation of the predictions of experts, and first show new second-order regret bo...
Abstract We study online aggregation of the predictions of experts, and first show new second-order ...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
We study a fundamental model of online preference aggregation, where an algorithm maintains an order...
We consider distributed online learning protocols that control the exchange of in-formation between ...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
We study the problem of online multiclass classification in a setting where the learner’s feedback i...
We study a new class of online learning problems where each of the online algorithm’s actions is ass...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
Learning algorithms are now routinely applied to data aggregated from millions of untrusted users, i...
We consider an online learning framework where the task is to predict a permutation which represents...
We consider a setting where a system learns to rank a fixed set of m items. The goal is produce a go...
We consider prediction with expert advice when the loss vectors are assumed to lie in a set describ...
We study a general class of learning algorithms, which we call regret-matching algorithms, along wit...
We study online aggregation of the predictions of experts, and first show new second-order regret bo...
Abstract We study online aggregation of the predictions of experts, and first show new second-order ...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
We study a fundamental model of online preference aggregation, where an algorithm maintains an order...
We consider distributed online learning protocols that control the exchange of in-formation between ...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
We study the problem of online multiclass classification in a setting where the learner’s feedback i...
We study a new class of online learning problems where each of the online algorithm’s actions is ass...
We provide a general mechanism to design online learning algorithms based on a minimax analysis with...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
Learning algorithms are now routinely applied to data aggregated from millions of untrusted users, i...