Abstract. Experimental evaluation and comparison of techniques, algo-rithms, approaches or complete systems is a crucial requirement to assess the practical impact of research results. The quality of published exper-imental results is usually limited due to several reasons such as: limited time, unavailability of standard benchmarks or shortage of computing resources. Moreover, achieving an independent, consistent, complete and insightful assessment for different alternatives in the same domain is a time and resource consuming task in addition to its requirement to be periodically repeated to maintain its freshness and being up-to-date. In this paper, we coin the notion of Liquid Benchmarks as online and public services that provide collabo...
This note marshals arguments for three points. First, it is better to test on small benchmark insta...
No existing evaluation infrastructure for shared tasks currently supports both reproducible on- and ...
Laboratory workflows and preclinical models have become increasingly diverse and complex. Confronted...
Abstract—Performances evaluation, benchmarking and re-producibility represent significant aspects fo...
Traditionally, peer-review focuses on the evaluation of scientific publications, literature products...
In this paper we review several novel approaches for research evaluation. We start with a brief over...
In the summary of the project as well as in the overall description for the SUCCESS project it is st...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
Website benchmarking approaches within service organisations continue to vary, and a majority of res...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
<div>The emergence of large-scale Internet applications and services has driven a surge in research ...
This paper argues that the common practice of benchmarking is inadequate as a scientific evaluation ...
Laboratory workflows and preclinical models have become increasingly diverse and complex. Confronted...
Benchmarking is a community-based and (preferably) community-driven activity involving consensus-bas...
With the increased interest in computational sciences, machine learning (ML), pattern recognition (P...
This note marshals arguments for three points. First, it is better to test on small benchmark insta...
No existing evaluation infrastructure for shared tasks currently supports both reproducible on- and ...
Laboratory workflows and preclinical models have become increasingly diverse and complex. Confronted...
Abstract—Performances evaluation, benchmarking and re-producibility represent significant aspects fo...
Traditionally, peer-review focuses on the evaluation of scientific publications, literature products...
In this paper we review several novel approaches for research evaluation. We start with a brief over...
In the summary of the project as well as in the overall description for the SUCCESS project it is st...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
Website benchmarking approaches within service organisations continue to vary, and a majority of res...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
<div>The emergence of large-scale Internet applications and services has driven a surge in research ...
This paper argues that the common practice of benchmarking is inadequate as a scientific evaluation ...
Laboratory workflows and preclinical models have become increasingly diverse and complex. Confronted...
Benchmarking is a community-based and (preferably) community-driven activity involving consensus-bas...
With the increased interest in computational sciences, machine learning (ML), pattern recognition (P...
This note marshals arguments for three points. First, it is better to test on small benchmark insta...
No existing evaluation infrastructure for shared tasks currently supports both reproducible on- and ...
Laboratory workflows and preclinical models have become increasingly diverse and complex. Confronted...