Traditional debuggers are of limited value for modern scientific codes that manipulate large complex data structures. Current parallel machines make this even more complicated, because the data structure may be distributed across processors, making it difficult to view/interpret and validate its contents. Therefore, many applications' developers resort to placing validation code directly in the source program. This paper discusses a novel debug-time assertion, called a "Statistical Assertion", that allows using extracted statistics instead of raw data to reason about large data structures, therefore help locating coding defects. In this paper, we present the design and implementation of an 'extendable' statistical-framework which executes t...
ARGE scientific codes are constantly evolving. Refine-ments in understanding physical phenomena resu...
Detecting and isolating bugs that arise only at high processor counts is a challenging task. Over a ...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
AbstractTraditional debuggers are of limited value for modern scientific codes that manipulate large...
Programming languages have advanced tremendously over the years, but program debuggers have hardly c...
Runtime verification of large-scale scientific codes is difficult because they often involve thousan...
Abstract—Statistical debugging identifies program behaviors that are highly correlated with failures...
Contemporary parallel debuggers allow users to control more than one processing thread while support...
AbstractRuntime verification of large-scale scientific codes is difficult because they often involve...
Petascale computers and computing systems have the potential to solve large-scale, data-intensive pr...
Relative debugging helps trace software errors by comparing two concurrent executions of a program -...
Debugging parallel programs is an order of magnitude more complex than sequential ones, and yet, mos...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
ARGE scientific codes are constantly evolving. Refine-ments in understanding physical phenomena resu...
Detecting and isolating bugs that arise only at high processor counts is a challenging task. Over a ...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
Traditional debuggers are of limited value for modern scientific codes that manipulate large complex...
AbstractTraditional debuggers are of limited value for modern scientific codes that manipulate large...
Programming languages have advanced tremendously over the years, but program debuggers have hardly c...
Runtime verification of large-scale scientific codes is difficult because they often involve thousan...
Abstract—Statistical debugging identifies program behaviors that are highly correlated with failures...
Contemporary parallel debuggers allow users to control more than one processing thread while support...
AbstractRuntime verification of large-scale scientific codes is difficult because they often involve...
Petascale computers and computing systems have the potential to solve large-scale, data-intensive pr...
Relative debugging helps trace software errors by comparing two concurrent executions of a program -...
Debugging parallel programs is an order of magnitude more complex than sequential ones, and yet, mos...
Petascale platforms with O(10{sup 5}) and O(10{sup 6}) processing cores are driving advancements in ...
ARGE scientific codes are constantly evolving. Refine-ments in understanding physical phenomena resu...
Detecting and isolating bugs that arise only at high processor counts is a challenging task. Over a ...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...