This work presents a general methodology for estimating the performance of an HPC workload when running on a future hardware architecture. Further, it demonstrates the methodology by estimating the performance of a significant scientific application — the Gyrokinetic Toroidal Code (GTC) — when executing on Sun’s proposed next-generation petascale computer architecture. For GTC, we identify the important phases of the iteration and perform low-level analysis that includes instruction tracing and component simulations of processor and memory systems. Lowlevel analysis is complemented with scalability estimates based on modeling MPI, OpenMP and I/O activity in the code. The work’s approach permits accurate end-to-end performance projections f...
Abstract—Large science projects rely on complex workflows to analyze terabytes or petabytes of data....
HPC application developers encounter significant challenges getting their codes to run correctly on ...
Big data is prevalent in HPC computing. Many HPC projects rely on complex workflows to analyze terab...
The Gyrokinetic Toroidal code (GTC) (version 2) is a 3D particle-in-cell application developed at th...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
Abstract. In this work, we discuss the porting to the GPU platform of the latest production version ...
The Gyrokinetic Toroidal Code (GTC) is a global, three-dimensional particle-in-cell application deve...
Abstract—Conventional programming practices on multicore processors in high performance computing ar...
The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
Performance modeling, the science of understanding and predicting application performance, is import...
Tuning the performance of applications requires understanding the interactions between code and targ...
The gyrokinetic Particle-in-Cell (PIC) method is a critical computational tool enabling petascale fu...
This paper presents a performance modeling methodology that is faster than traditional cycle-accurat...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
Abstract—Large science projects rely on complex workflows to analyze terabytes or petabytes of data....
HPC application developers encounter significant challenges getting their codes to run correctly on ...
Big data is prevalent in HPC computing. Many HPC projects rely on complex workflows to analyze terab...
The Gyrokinetic Toroidal code (GTC) (version 2) is a 3D particle-in-cell application developed at th...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
Abstract. In this work, we discuss the porting to the GPU platform of the latest production version ...
The Gyrokinetic Toroidal Code (GTC) is a global, three-dimensional particle-in-cell application deve...
Abstract—Conventional programming practices on multicore processors in high performance computing ar...
The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
Performance modeling, the science of understanding and predicting application performance, is import...
Tuning the performance of applications requires understanding the interactions between code and targ...
The gyrokinetic Particle-in-Cell (PIC) method is a critical computational tool enabling petascale fu...
This paper presents a performance modeling methodology that is faster than traditional cycle-accurat...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
Abstract—Large science projects rely on complex workflows to analyze terabytes or petabytes of data....
HPC application developers encounter significant challenges getting their codes to run correctly on ...
Big data is prevalent in HPC computing. Many HPC projects rely on complex workflows to analyze terab...