Execution monitoring plays a central role in most software development tools for parallel and distributed computer systems. However, such monitoring may induce delays that corrupt event timing. In this paper we introduce a perturbation analysis-like algorithm that, given a safe timed Petri net model of the monitored software, can recover the uncorrupted event timings, i.e., those that would have been observed had the delays not been present. Monitoring conditions sufficient to ensure correct operation of the algorithm, and examples illustrating the algorithm's applicability to message-passing systems are also presented. This work is part of a larger effort aimed at identifying cost-effective software alternatives to custom hardware mon...
A fundamental problem with parallel program monitoring tools is the intrusiveness introduced by inst...
The need for increased computing capability and more diverse hardware with its evermore complex topo...
ABSTRACT: We propose a new class of profiler for distributed and heterogeneous systems. In these sys...
The complexity of tracking perturbations in discrete event dynamic systems (DEDS) depends on the sys...
This paper extends results concerning the recovery of accurate parallel program traces from corrupte...
A task which is a part of a real time application must not only perform a specific function but it m...
Determining the performance behavior of parallel com-putations requires some form of intrusive traci...
International audienceAssessing the correct behavior of a given system at run-time can be achieved b...
The use of an observer in Petri net control may significantly reduce the performance of the closed-l...
This thesis focuses on the notation of representative quality of software generated traces of messag...
Supercomputing is a key technological pillar of modern science and engineering, indispensable for so...
Monitoring real-time concurrent systems is a challenging task. In this paper we formulate (model-bas...
Abstract. Tracing parallel programs to observe their performance introduces in-trusion as the result...
This work concerns the tracking of timing perturbations in discrete event dynamic systems (DEDS) mod...
Fay is a flexible platform for the efficient collection, processing, and analysis of software execut...
A fundamental problem with parallel program monitoring tools is the intrusiveness introduced by inst...
The need for increased computing capability and more diverse hardware with its evermore complex topo...
ABSTRACT: We propose a new class of profiler for distributed and heterogeneous systems. In these sys...
The complexity of tracking perturbations in discrete event dynamic systems (DEDS) depends on the sys...
This paper extends results concerning the recovery of accurate parallel program traces from corrupte...
A task which is a part of a real time application must not only perform a specific function but it m...
Determining the performance behavior of parallel com-putations requires some form of intrusive traci...
International audienceAssessing the correct behavior of a given system at run-time can be achieved b...
The use of an observer in Petri net control may significantly reduce the performance of the closed-l...
This thesis focuses on the notation of representative quality of software generated traces of messag...
Supercomputing is a key technological pillar of modern science and engineering, indispensable for so...
Monitoring real-time concurrent systems is a challenging task. In this paper we formulate (model-bas...
Abstract. Tracing parallel programs to observe their performance introduces in-trusion as the result...
This work concerns the tracking of timing perturbations in discrete event dynamic systems (DEDS) mod...
Fay is a flexible platform for the efficient collection, processing, and analysis of software execut...
A fundamental problem with parallel program monitoring tools is the intrusiveness introduced by inst...
The need for increased computing capability and more diverse hardware with its evermore complex topo...
ABSTRACT: We propose a new class of profiler for distributed and heterogeneous systems. In these sys...