This article outlines the parallelization of an irregular grid application with SVM-Fortran. It describes the different optimizations and their effectiveness. The parallelization was much simplified by the performance analysis tool OPAL, a source code based tool for requesting and analyzing runtime performance data. Although shared memory parallelization is easier than distributed memory parallelization, understanding and eliminating the overhead from page faults is impossible without such a tool. It relates the page faults to the arrays and to the location in the source code. An area which is not supported by OPAL but where supporting tools are highly desirable, is the performance degradation due to low utilization of the on-chip cache
Writing applications for high performance computers is a challenging task. Although writing code by ...
Highly parallel machines needed to solve compute-intensive scientific applications are based on the ...
Programming distributed memory systems forces the user to handle the problem of data locality. With ...
Programming distributed memory parallel computers with message passing is often considered to be a d...
This article outlines the parallelization of an irregular grid application based on shared virtual m...
SVM-Fortran is a language extension of Fortran 77 developed by KFA for shared memory parallel progra...
By using the shared memory programming model it is possible to parallelize applications step by step...
Programming distributed memory multiprocessors requires program parallelization as well as program o...
This report outlines the design of a performance analysis environment for SVM-Fortran programs. SVM-...
Programming distributed memory multiprocessors requires program parallelization as well as program o...
This paper describes a performance tuning tool, named SVMview, for DSM-based parallel computers. SVM...
SVM-Fortran is a language designed to program highly parallel systems with a global address space. A...
Vienna Fortran, a machine-independent language extension to Fortran which allows the user to write p...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
This deliverable gives a detailed language specification of a data-parallel programming language for...
Writing applications for high performance computers is a challenging task. Although writing code by ...
Highly parallel machines needed to solve compute-intensive scientific applications are based on the ...
Programming distributed memory systems forces the user to handle the problem of data locality. With ...
Programming distributed memory parallel computers with message passing is often considered to be a d...
This article outlines the parallelization of an irregular grid application based on shared virtual m...
SVM-Fortran is a language extension of Fortran 77 developed by KFA for shared memory parallel progra...
By using the shared memory programming model it is possible to parallelize applications step by step...
Programming distributed memory multiprocessors requires program parallelization as well as program o...
This report outlines the design of a performance analysis environment for SVM-Fortran programs. SVM-...
Programming distributed memory multiprocessors requires program parallelization as well as program o...
This paper describes a performance tuning tool, named SVMview, for DSM-based parallel computers. SVM...
SVM-Fortran is a language designed to program highly parallel systems with a global address space. A...
Vienna Fortran, a machine-independent language extension to Fortran which allows the user to write p...
This paper presents a profiling tool that allows the programmer to identify the regions of the progr...
This deliverable gives a detailed language specification of a data-parallel programming language for...
Writing applications for high performance computers is a challenging task. Although writing code by ...
Highly parallel machines needed to solve compute-intensive scientific applications are based on the ...
Programming distributed memory systems forces the user to handle the problem of data locality. With ...