ATHAPASCAN-0 programs are executed by a network of communicating threads evolving dynamically. Within the same node, threads communicate through shared memory and synchronization primitives. Between two different nodes, threads communicate by message passing. Execution replay of ATHAPASCAN-0 programs addresses the non-determinism arising from synchronization races, from promiscuous messages received from non specified source and from the varying number of operations testing the completion of non blocking ATHAPASCAN-0 primitives. The execution replay mechanism is mainly control-base- d since, in addition to recording the results of test operations, only the order of accesses to synchronization functions and the order of arrival of promiscuou...
Replay of parallel execution is required by HPC debuggers and resilience mechanisms. Up-to-date, the...
Athapascan is a macro data-flow application programming interface (API) for asynchronous parallel pr...
Recent research in deterministic record-replayseeks to ease debugging, security, and fault tolerance...
Clusters of shared-memory symmetric multiprocessors are increasingly used for high performance...
In this paper we present an execution replay system for Athapascan, an MPI-based multi-threaded runt...
This thesis defines an execution model for parallel programs based on remote procedure calls compute...
Shared-memory parallel programs are inherently nondeterministic, making it difficult to diagnose rar...
The ability to reproduce a parallel execution is desirable for debugging and program reliability pur...
The debugging cycle is the most common methodology for finding and correcting errors in sequential p...
The debugging cycle is the most common methodology for finding and correcting errors in sequential p...
Debugging MIMD programs is often a delicate job. As a matter of fact, they can have different behavi...
Ability to replay a program’s execution on a multi-processor system can significantly help parallel ...
abstract: Debugging is a hard task. Debugging multi-threaded applications with their inherit non-det...
This paper presents a tool that enables programmers to use dynamic testing tools for de-bugging non-...
While a lot of work has been focused on design and programming of shared memory multi-core architect...
Replay of parallel execution is required by HPC debuggers and resilience mechanisms. Up-to-date, the...
Athapascan is a macro data-flow application programming interface (API) for asynchronous parallel pr...
Recent research in deterministic record-replayseeks to ease debugging, security, and fault tolerance...
Clusters of shared-memory symmetric multiprocessors are increasingly used for high performance...
In this paper we present an execution replay system for Athapascan, an MPI-based multi-threaded runt...
This thesis defines an execution model for parallel programs based on remote procedure calls compute...
Shared-memory parallel programs are inherently nondeterministic, making it difficult to diagnose rar...
The ability to reproduce a parallel execution is desirable for debugging and program reliability pur...
The debugging cycle is the most common methodology for finding and correcting errors in sequential p...
The debugging cycle is the most common methodology for finding and correcting errors in sequential p...
Debugging MIMD programs is often a delicate job. As a matter of fact, they can have different behavi...
Ability to replay a program’s execution on a multi-processor system can significantly help parallel ...
abstract: Debugging is a hard task. Debugging multi-threaded applications with their inherit non-det...
This paper presents a tool that enables programmers to use dynamic testing tools for de-bugging non-...
While a lot of work has been focused on design and programming of shared memory multi-core architect...
Replay of parallel execution is required by HPC debuggers and resilience mechanisms. Up-to-date, the...
Athapascan is a macro data-flow application programming interface (API) for asynchronous parallel pr...
Recent research in deterministic record-replayseeks to ease debugging, security, and fault tolerance...