This article highlights how small modifications to either the source code of a benchmark program or the compilation options may impact its behavior on a specific machine. It argues that for evaluating machines, benchmark providers and users be careful to ensure reproducibility of results based on the machine code actually running on the hardware and not just source code. The article uses color to grayscale conversion of digital images as a running example
12 pagesThe community of program optimisation and analysis, code performance evaluation, parallelisa...
This is a supporting code artifact for the publication: "CompilerGym: Robust, Performant Compiler Op...
The decision which hardware platform to use for a certain application is an important problem in com...
Historically, benchmarks have been used for commercial purposes. A customer develops or selects a be...
Abstract Benchmarking is an essential step in the development of computational tools. We take this o...
International audienceApplication benchmarking is a widely trusted method of performance evaluation....
This note marshals arguments for three points. First, it is better to test on small benchmark insta...
Industry vendors hesitate to disseminate proprietary applications to academia and third party vendor...
Benchmarks that closely match the behavior of production workloads are crucial to design and provisi...
The authors discuss the important questions that benchmarking must answer and the degree to which su...
There have been several papers published relating to the practice of benchmarking in machine learnin...
Benchmarks set standards for innovation in computer architecture research and industry product devel...
Numerical validation is at the core of machine learning research as it allows to assess the actual i...
Standard benchmarking provides the run times for given programs on given machines, but fails to prov...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
12 pagesThe community of program optimisation and analysis, code performance evaluation, parallelisa...
This is a supporting code artifact for the publication: "CompilerGym: Robust, Performant Compiler Op...
The decision which hardware platform to use for a certain application is an important problem in com...
Historically, benchmarks have been used for commercial purposes. A customer develops or selects a be...
Abstract Benchmarking is an essential step in the development of computational tools. We take this o...
International audienceApplication benchmarking is a widely trusted method of performance evaluation....
This note marshals arguments for three points. First, it is better to test on small benchmark insta...
Industry vendors hesitate to disseminate proprietary applications to academia and third party vendor...
Benchmarks that closely match the behavior of production workloads are crucial to design and provisi...
The authors discuss the important questions that benchmarking must answer and the degree to which su...
There have been several papers published relating to the practice of benchmarking in machine learnin...
Benchmarks set standards for innovation in computer architecture research and industry product devel...
Numerical validation is at the core of machine learning research as it allows to assess the actual i...
Standard benchmarking provides the run times for given programs on given machines, but fails to prov...
One of the primary methods employed by researchers to judge the merits of new heuristics and algorit...
12 pagesThe community of program optimisation and analysis, code performance evaluation, parallelisa...
This is a supporting code artifact for the publication: "CompilerGym: Robust, Performant Compiler Op...
The decision which hardware platform to use for a certain application is an important problem in com...