This online course organised in cooperation with NHR@FAU covers performance engineering approaches on the compute node level. Even application developers who are fluent in OpenMP and MPI often lack a good grasp of how much performance could at best be achieved by their code. This is because parallelism takes us only half the way to good performance. Even worse, slow serial code tends to scale very well, hiding the fact that resources are wasted. This course conveys the required knowledge to develop a thorough understanding of the interactions between software and hardware. This process must start at the core, socket, and node level, where the code gets executed that does the actual computational work. We introduce the basic architectural ...
Achieving high application performance depends on the combination of memory footprint, instruction m...
Many-core architectures face significant hurdles to successful adoption by ISVs, and ultimately, the...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Description This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs)...
This course covers techniques for improving the performance of parallel applications by optimising o...
Modern computing platforms provide unprecedented amounts of raw computational power. But significant...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
Maximizing the productivity of modern multicore and manycore chips requires optimizing parallelism a...
Performance, which defines multiple aspects of the quality of a software system, is one of the most ...
Performance, which defines multiple aspects of the quality of a software system, is one of the most ...
Achieving good performance on high-end computing systems is growing ever more challenging due to eno...
Achieving high application performance depends on the combination of memory footprint, instruction m...
Many-core architectures face significant hurdles to successful adoption by ISVs, and ultimately, the...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Description This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs)...
This course covers techniques for improving the performance of parallel applications by optimising o...
Modern computing platforms provide unprecedented amounts of raw computational power. But significant...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
Maximizing the productivity of modern multicore and manycore chips requires optimizing parallelism a...
Performance, which defines multiple aspects of the quality of a software system, is one of the most ...
Performance, which defines multiple aspects of the quality of a software system, is one of the most ...
Achieving good performance on high-end computing systems is growing ever more challenging due to eno...
Achieving high application performance depends on the combination of memory footprint, instruction m...
Many-core architectures face significant hurdles to successful adoption by ISVs, and ultimately, the...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...