pre-printReproducibility, the ability to repeat program executions with the same numerical result or code behavior, is crucial for computational science and engineering applications. However, non-determinism in concurrency scheduling often hampers achieving this ability on high performance computing (HPC) systems. To aid in managing the adverse effects of non-determinism, prior work has provided techniques to achieve bit-precise reproducibility, but most of them focus only on small-scale parallelism. While scalable techniques recently emerged, they are disparate and target special purposes, e.g., single-schedule domains. On current systems with O(106) compute cores and future ones with O(109), any technique that does not embrace a unied, ta...
In an era where we can not afford to checkpoint frequently, replication is a generic way forward to ...
Constant reduction in the size of transistors has made it possible to implement many cores on a sing...
We present QuickStep, a novel system for parallelizing sequential programs. Unlike standard parallel...
The advent of multicores mandates parallel programming. While parallelism presents a panoply of prob...
Abstract—Bit-reproducibility has many advantages in the context of high-performance computing. Besid...
Our accelerating computational demand and the rise of multicore hardware have made parallel programs...
On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, espec...
International audienceQuestions whether numerical simulation is reproducible or not have been report...
Taufer, MichelaNon-determinism in high performance scientific applications has severe detri- mental ...
We examine the problem of providing a parallel programming model that guarantees deterministic seman...
Parallelism is often required for performance. In these situations an excess of non-determinism is h...
Multicore shared-memory architectures are becoming prevalent but bring many programming challenges. ...
In the multicore era, a major programming task will be to make programs more parallel. This is ted...
Numerical Reproducibility at Exascale (NRE2015) workshop held as part of the Supercomputing Conferen...
A defining challenge for research in computer science and engineering has been the ongoing quest for...
In an era where we can not afford to checkpoint frequently, replication is a generic way forward to ...
Constant reduction in the size of transistors has made it possible to implement many cores on a sing...
We present QuickStep, a novel system for parallelizing sequential programs. Unlike standard parallel...
The advent of multicores mandates parallel programming. While parallelism presents a panoply of prob...
Abstract—Bit-reproducibility has many advantages in the context of high-performance computing. Besid...
Our accelerating computational demand and the rise of multicore hardware have made parallel programs...
On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, espec...
International audienceQuestions whether numerical simulation is reproducible or not have been report...
Taufer, MichelaNon-determinism in high performance scientific applications has severe detri- mental ...
We examine the problem of providing a parallel programming model that guarantees deterministic seman...
Parallelism is often required for performance. In these situations an excess of non-determinism is h...
Multicore shared-memory architectures are becoming prevalent but bring many programming challenges. ...
In the multicore era, a major programming task will be to make programs more parallel. This is ted...
Numerical Reproducibility at Exascale (NRE2015) workshop held as part of the Supercomputing Conferen...
A defining challenge for research in computer science and engineering has been the ongoing quest for...
In an era where we can not afford to checkpoint frequently, replication is a generic way forward to ...
Constant reduction in the size of transistors has made it possible to implement many cores on a sing...
We present QuickStep, a novel system for parallelizing sequential programs. Unlike standard parallel...