The aim of the project is to develop a light-weight MPI profiling library that differentiates between local communication (intra-node or nearby nodes) and distant communication (> 1 hop) within the 3D torus of BlueGene sys-tems. The library should use the standard PMPI profiling interface and be as portable as possible. It involves the instrumentation in C- all the point to point MPI functions (MPI [ISB]Send and MPI [I]Recv) and communica-tor creation (to keep track of the actual MPI ranks). It is also necessary to identify a suitable trace file format, and developing a tool to parse/analyse the trace files with a high-level programming language (e.g. Python). The impact of process placement on torus will be assessed on systems such as B...
An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profili...
The goal of this thesis is to develop a new Channel Interface device for the MPICH Implementation of...
Abstract. The BlueGene/L supercoputer, with 65,536 dual-processor compute nodes, was designed from t...
PN MPI extends the PMPI profiling interface to support multiple concurrent PMPI-based tools by enabl...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
The need for intuitive parallel programming designs has grown with the rise of modern many-core proc...
Event tracing of parallel programs can provide valuable information about program performance. The d...
MPI is the de-facto standard for inter-node communication on HPC systems, and has been for the past ...
Abstract. The BlueGene/L computer uses system-on-a-chip integration and a highly scalable 65,536-nod...
Abstract. The BlueGene/L supercomputer will consist of 65,536 dual-processor compute nodes interconn...
An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profili...
Abstract. Modern HEC systems, such as Blue Gene/P, rely on achiev-ing high-performance by using the ...
With processor speeds no longer doubling every 18-24 months owing to the exponential increase in pow...
MPI (Message Passing Interface) is a standard specification for message-passing libraries. mpich is ...
In this project we studied the practical use of the MPI message-passing interface in advanced distri...
An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profili...
The goal of this thesis is to develop a new Channel Interface device for the MPICH Implementation of...
Abstract. The BlueGene/L supercoputer, with 65,536 dual-processor compute nodes, was designed from t...
PN MPI extends the PMPI profiling interface to support multiple concurrent PMPI-based tools by enabl...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
The need for intuitive parallel programming designs has grown with the rise of modern many-core proc...
Event tracing of parallel programs can provide valuable information about program performance. The d...
MPI is the de-facto standard for inter-node communication on HPC systems, and has been for the past ...
Abstract. The BlueGene/L computer uses system-on-a-chip integration and a highly scalable 65,536-nod...
Abstract. The BlueGene/L supercomputer will consist of 65,536 dual-processor compute nodes interconn...
An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profili...
Abstract. Modern HEC systems, such as Blue Gene/P, rely on achiev-ing high-performance by using the ...
With processor speeds no longer doubling every 18-24 months owing to the exponential increase in pow...
MPI (Message Passing Interface) is a standard specification for message-passing libraries. mpich is ...
In this project we studied the practical use of the MPI message-passing interface in advanced distri...
An MPI profiling library is a standard mechanism for intercepting MPI calls by applications. Profili...
The goal of this thesis is to develop a new Channel Interface device for the MPICH Implementation of...
Abstract. The BlueGene/L supercoputer, with 65,536 dual-processor compute nodes, was designed from t...