Application performance on a high performance, parallel platform depends on a variety of factors, the most important being the performance of the high speed interconnect and the compute node processor. The performance of the compute processor depends on how well the compiler optimizes for a given processor architecture, and how well it optimizes the applications source code. An analysis of uni-processor and parallel performance using different AMD Opteron compilers on key SNL application codes is presented.
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The number of transistors as well as the frequency of processors have followed Moore's law for the p...
Experience with commercial and research high-performance architectures has indicated that the compil...
Over the past 15 years, microprocessor performance has doubled approximately every 18 months throug...
Applying the right compiler optimizations to a particular program can have a significant impact on p...
Performance increase, in terms of faster execution and energy efficiency, is a never-ending research...
Abstract –How to evaluate computer’s performance is an important issue for engineers in the area of ...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Fully utilizing the potential of parallel architectures is known to be a challenging task. In the pa...
This course covers techniques for improving the performance of parallel applications by optimising o...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
This paper studies the overall system power variations of two multi-core architectures, an 8-core In...
[[abstract]]A number recently released numerical libraries including automatically tuned linear alge...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The number of transistors as well as the frequency of processors have followed Moore's law for the p...
Experience with commercial and research high-performance architectures has indicated that the compil...
Over the past 15 years, microprocessor performance has doubled approximately every 18 months throug...
Applying the right compiler optimizations to a particular program can have a significant impact on p...
Performance increase, in terms of faster execution and energy efficiency, is a never-ending research...
Abstract –How to evaluate computer’s performance is an important issue for engineers in the area of ...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Production compilers have achieved a high level of maturity in terms of generating efficient code. C...
Fully utilizing the potential of parallel architectures is known to be a challenging task. In the pa...
This course covers techniques for improving the performance of parallel applications by optimising o...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
This paper studies the overall system power variations of two multi-core architectures, an 8-core In...
[[abstract]]A number recently released numerical libraries including automatically tuned linear alge...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The number of transistors as well as the frequency of processors have followed Moore's law for the p...
Experience with commercial and research high-performance architectures has indicated that the compil...