Large scientific code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose conservative synchronization patterns leading to suboptimal performance. In this paper, we present context-sensitive dynamic optimizations that elide barriers redundant during the program execution. In our technique, we perform data race detection alongside the program to identify redundant barriers in their calling contexts; after an initial learning, we start eliding all future instances of barriers occurring in the same calling context. We present an automatic on-the-fly optimization and a multi-pass guided optimization. We apply our techniques to NWChem - a 6 mi...
Thesis (Ph.D.)--University of Washington, 2014Some researchers have proposed data-race exceptions to...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
Abstract- We introduce ToleRace, a runtime system that allows programs to detect and even tolerate a...
Large scientific code bases are often composed of several layers of runtime libraries, implemented i...
: Many coarse-grained, explicitly parallel programs execute in phases delimited by barriers to prese...
A fundamental problem in the analysis of parallel programs is to determine when two statements in a ...
In this paper we present a precise data race detection tech-nique for distributed memory parallel pr...
Scientific applications are ideal candidates for the “heterogeneous computing” paradigm, in which pa...
There exist significant, well established code bases in the scientific computing and research commun...
Many parallel programs are written in SPMD style, i.e. by running the same sequential program on all...
With modern advancements in hardware and software technology scaling towards new limits, our compute...
Multicore architectures are an increasingly important technique used to achieve increased performanc...
A data race condition occurs when multiple processes of a parallel program with a shared memory spac...
Parallel computing is pervasive. The variety and number of parallel hardware architectures increase...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2012.Speculative parallelizatio...
Thesis (Ph.D.)--University of Washington, 2014Some researchers have proposed data-race exceptions to...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
Abstract- We introduce ToleRace, a runtime system that allows programs to detect and even tolerate a...
Large scientific code bases are often composed of several layers of runtime libraries, implemented i...
: Many coarse-grained, explicitly parallel programs execute in phases delimited by barriers to prese...
A fundamental problem in the analysis of parallel programs is to determine when two statements in a ...
In this paper we present a precise data race detection tech-nique for distributed memory parallel pr...
Scientific applications are ideal candidates for the “heterogeneous computing” paradigm, in which pa...
There exist significant, well established code bases in the scientific computing and research commun...
Many parallel programs are written in SPMD style, i.e. by running the same sequential program on all...
With modern advancements in hardware and software technology scaling towards new limits, our compute...
Multicore architectures are an increasingly important technique used to achieve increased performanc...
A data race condition occurs when multiple processes of a parallel program with a shared memory spac...
Parallel computing is pervasive. The variety and number of parallel hardware architectures increase...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2012.Speculative parallelizatio...
Thesis (Ph.D.)--University of Washington, 2014Some researchers have proposed data-race exceptions to...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
Abstract- We introduce ToleRace, a runtime system that allows programs to detect and even tolerate a...