The scaling of Silicon devices has exacerbated the unreliability of modern computer systems, and power constraints have necessitated the involvement of software in hardware error detection. Simultaneously, the multi-core revolution has impelled software to become parallel. Therefore, there is a compelling need to protect parallel programs from hardware errors. Parallel programs’ tasks have significant similarity in control data due to the use of high-level programming models. In this thesis, we propose BlockWatch to leverage the similarity in parallel program’s control data for detecting hardware errors. BlockWatch statically extracts the similarity among different threads of a parallel program and checks the similarity at runtime. We evalu...
Parallel computing is pervasive. The variety and number of parallel hardware architectures increase...
Multicore machines have become pervasive and, as a result, parallel programming has received renewed...
With speculative thread-level parallelization, codes that cannot be fully compiler-analyzed are aggr...
The scaling of Silicon devices has exacerbated the unreliability of modern computer systems, and pow...
Microprocessor error detection is increasingly important, as the number of transistors in modern sys...
A popular approach for producing parallel software is to de-velop a sequential version of an applica...
Hardware errors are on the rise with reducing chip sizes, and power constraints have necessitated th...
With the advent of multicores, there is demand for monitoring parallelprograms running on multicores...
Abstract. Writing multithreaded software for multicore computers con-fronts many developers with the...
Multicore machines have become pervasive and, as a result, parallel programming has received renewe...
Multicore machines have become pervasive and, as a result, parallel programming has received renewe...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
International audienceTo efficiently exploit the resources of new many-core architectures, integrati...
The complexity of modern software makes it difficult to ship correct programs. Errors can cost money...
In this paper, a software behavior-based technique is presented to detect control-flow errors in mul...
Parallel computing is pervasive. The variety and number of parallel hardware architectures increase...
Multicore machines have become pervasive and, as a result, parallel programming has received renewed...
With speculative thread-level parallelization, codes that cannot be fully compiler-analyzed are aggr...
The scaling of Silicon devices has exacerbated the unreliability of modern computer systems, and pow...
Microprocessor error detection is increasingly important, as the number of transistors in modern sys...
A popular approach for producing parallel software is to de-velop a sequential version of an applica...
Hardware errors are on the rise with reducing chip sizes, and power constraints have necessitated th...
With the advent of multicores, there is demand for monitoring parallelprograms running on multicores...
Abstract. Writing multithreaded software for multicore computers con-fronts many developers with the...
Multicore machines have become pervasive and, as a result, parallel programming has received renewe...
Multicore machines have become pervasive and, as a result, parallel programming has received renewe...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
International audienceTo efficiently exploit the resources of new many-core architectures, integrati...
The complexity of modern software makes it difficult to ship correct programs. Errors can cost money...
In this paper, a software behavior-based technique is presented to detect control-flow errors in mul...
Parallel computing is pervasive. The variety and number of parallel hardware architectures increase...
Multicore machines have become pervasive and, as a result, parallel programming has received renewed...
With speculative thread-level parallelization, codes that cannot be fully compiler-analyzed are aggr...