International audienceMachine learning progress relies on algorithm benchmarks. We study the problem of declaring a winner, or ranking "candidate" algorithms, based on results obtained by "judges" (scores on various tasks). Inspired by social science and game theory on fair elections, we compare various ranking functions, ranging from simple score averaging to Condorcet methods. We devise novel empirical criteria to assess the quality of ranking functions, including the generalization to new tasks and the stability under judge or candidate perturbation. We conduct an empirical comparison on the results of 5 competitions and benchmarks (one artificially generated). While prior theoretical analyses indicate that no single ranking function sat...
This article presents a new model for scoring alternatives from “contest ” outcomes. The model is a ...
AbstractWhen selecting the best contender based on relative rankings of k contenders, the definition...
In Machine Learning, a benchmark refers to an ensemble of datasets associated with one or multiple m...
International audienceMachine learning progress relies on algorithm benchmarks. We study the problem...
International audienceWe address the problem of selecting a winning algorithm in a challenge or benc...
Learning to rank is an increasingly important scientific field that comprises the use of machine lea...
Many scenarios in our daily life require us to infer some ranking over items or people based on limi...
Algorithmic decision-making has become ubiquitous in our societal and economic lives. With more and ...
In peer selection a group of agents must choose a subset of themselves, as winners for, e.g., peer-r...
In peer selection a group of agents must choose a subset of themselves, as winners for, e.g., peer-r...
Today, ranking is the de facto way that information is presented to users in automated systems, whic...
Whereas benchmarking experiments are very frequently used to investigate the perfor-mance of statist...
The algorithm selection problem is to choose the most suitable algorithm for solving a given problem...
Preference aggregation is the process of combining multiple preferences orders into one global ranki...
This article presents a new model for scoring alternatives from “contest ” outcomes. The model is a ...
AbstractWhen selecting the best contender based on relative rankings of k contenders, the definition...
In Machine Learning, a benchmark refers to an ensemble of datasets associated with one or multiple m...
International audienceMachine learning progress relies on algorithm benchmarks. We study the problem...
International audienceWe address the problem of selecting a winning algorithm in a challenge or benc...
Learning to rank is an increasingly important scientific field that comprises the use of machine lea...
Many scenarios in our daily life require us to infer some ranking over items or people based on limi...
Algorithmic decision-making has become ubiquitous in our societal and economic lives. With more and ...
In peer selection a group of agents must choose a subset of themselves, as winners for, e.g., peer-r...
In peer selection a group of agents must choose a subset of themselves, as winners for, e.g., peer-r...
Today, ranking is the de facto way that information is presented to users in automated systems, whic...
Whereas benchmarking experiments are very frequently used to investigate the perfor-mance of statist...
The algorithm selection problem is to choose the most suitable algorithm for solving a given problem...
Preference aggregation is the process of combining multiple preferences orders into one global ranki...
This article presents a new model for scoring alternatives from “contest ” outcomes. The model is a ...
AbstractWhen selecting the best contender based on relative rankings of k contenders, the definition...
In Machine Learning, a benchmark refers to an ensemble of datasets associated with one or multiple m...