Abstract—Statistical debugging identifies program behaviors that are highly correlated with failures. Traditionally, this ap-proach has been applied to desktop software on which it is effective in identifying the causes that underlie several difficult classes of bugs including: memory corruption, non-deterministic bugs, and bugs with multiple temporally-distant triggers. The domain of scientific computing offers a new target for this type of debugging. Scientific code is run at massive scales offering massive quantities of statistical feedback data. Data collection can scale well because it requires no communication between compute nodes. Unfortunately, existing statistical debugging techniques impose run-time overhead that is unsuitable fo...
There are few runtime tools for modestly sized computing systems, with 10^3 processors, and above th...
Debugging parallel programs is an order of magnitude more complex than sequential ones, and yet, mos...
AbstractRuntime verification of large-scale scientific codes is difficult because they often involve...
Statistical debugging identifies program behaviors that are highly correlated with failures. Tra...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
AbstractTraditional debuggers are of limited value for modern scientific codes that manipulate large...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Relative debugging helps trace software errors by comparing two concurrent executions of a program -...
Contemporary parallel debuggers allow users to control more than one processing thread while support...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
Petascale computers and computing systems have the potential to solve large-scale, data-intensive pr...
Today's largest systems have over 100,000 cores, with million-core systems expected over the next fe...
Runtime verification of large-scale scientific codes is difficult because they often involve thousan...
There are few runtime tools for modestly sized computing systems, with 10^3 processors, and above th...
Debugging parallel programs is an order of magnitude more complex than sequential ones, and yet, mos...
AbstractRuntime verification of large-scale scientific codes is difficult because they often involve...
Statistical debugging identifies program behaviors that are highly correlated with failures. Tra...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
AbstractTraditional debuggers are of limited value for modern scientific codes that manipulate large...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Relative debugging helps trace software errors by comparing two concurrent executions of a program -...
Contemporary parallel debuggers allow users to control more than one processing thread while support...
Developing correct and efficient software for large scale systems is a challenging task. Developers ...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
Petascale computers and computing systems have the potential to solve large-scale, data-intensive pr...
Today's largest systems have over 100,000 cores, with million-core systems expected over the next fe...
Runtime verification of large-scale scientific codes is difficult because they often involve thousan...
There are few runtime tools for modestly sized computing systems, with 10^3 processors, and above th...
Debugging parallel programs is an order of magnitude more complex than sequential ones, and yet, mos...
AbstractRuntime verification of large-scale scientific codes is difficult because they often involve...