In Machine Learning, a benchmark refers to an ensemble of datasets associated with one or multiple metrics together with a way to aggregate different systems performances. They are instrumental in (i) assessing the progress of new methods along different axes and (ii) selecting the best systems for practical use. This is particularly the case for NLP with the development of large pre-trained models (e.g. GPT, BERT) that are expected to generalize well on a variety of tasks. While the community mainly focused on developing new datasets and metrics, there has been little interest in the aggregation procedure, which is often reduced to a simple average over various performance measures. However, this procedure can be problematic when the metri...
We present a cross-benchmark comparison of learning-to-rank methods using two evaluation measures: t...
The availability of a huge mass of textual data in electronic format has increased the need for fast...
International audienceThe availability of a huge mass of textual data in electronic format has incre...
In Machine Learning, a benchmark refers to an ensemble of datasets associatedwith one or multiple me...
Learning to rank is an increasingly important scientific field that comprises the use of machine lea...
Whereas benchmarking experiments are very frequently used to investigate the perfor-mance of statist...
Benchmark experiments nowadays are the method of choice to evaluate learn-ing algorithms in most res...
Ranking problems are ubiquitous and occur in a variety of domains that include social choice, inform...
International audienceMachine learning progress relies on algorithm benchmarks. We study the problem...
The problem of ranking arises ubiquitously in almost every aspect of life, and in particular in Mach...
In this thesis, we discuss two issues in the learning to rank area, choosing effective objective lo...
We present methods to answer two basic questions that arise when benchmarking optimization algorithm...
Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favo...
We demonstrate that there are machine learning algorithms that can achieve success for two separate ...
We present a cross-benchmark comparison of learning-to-rank methods using two evaluation measures: t...
The availability of a huge mass of textual data in electronic format has increased the need for fast...
International audienceThe availability of a huge mass of textual data in electronic format has incre...
In Machine Learning, a benchmark refers to an ensemble of datasets associatedwith one or multiple me...
Learning to rank is an increasingly important scientific field that comprises the use of machine lea...
Whereas benchmarking experiments are very frequently used to investigate the perfor-mance of statist...
Benchmark experiments nowadays are the method of choice to evaluate learn-ing algorithms in most res...
Ranking problems are ubiquitous and occur in a variety of domains that include social choice, inform...
International audienceMachine learning progress relies on algorithm benchmarks. We study the problem...
The problem of ranking arises ubiquitously in almost every aspect of life, and in particular in Mach...
In this thesis, we discuss two issues in the learning to rank area, choosing effective objective lo...
We present methods to answer two basic questions that arise when benchmarking optimization algorithm...
Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favo...
We demonstrate that there are machine learning algorithms that can achieve success for two separate ...
We present a cross-benchmark comparison of learning-to-rank methods using two evaluation measures: t...
The availability of a huge mass of textual data in electronic format has increased the need for fast...
International audienceThe availability of a huge mass of textual data in electronic format has incre...