Abstract. The Sun UltraSparc T2+ processor was designed for throughput com-puting and thread level parallelism. In this paper we evaluate its suitability for computational science. A set of benchmarks representing typical building blocks of scientific applications and a real-world hybrid MPI/OpenMP code for ocean simulation are used for performance evaluation. Additionally we apply micro benchmarks to evaluate the performance of certain components (such as the mem-ory subsystem). To recognise the capabilities of the T2+ processor we com-pare its performance with the IBM POWER6 processor. While the UltraSparc T2+ is targeted on server workloads with high throughput requirements via low-frequency core design and massive chip multithreading ca...
Thesis: S.M., Massachusetts Institute of Technology, Department of Nuclear Science and Engineering, ...
There is a growing gap between the peak speed of parallel computing systems and the actual delivered...
There is a growing gap between the peak speed of parallel computing systems and the actual delivere...
Energy consumption is a major concern with high performance multicore systems. In this paper, we exp...
Recently, researchers at NASA Ames have defined a set of computational benchmarks designed to measur...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing...
In this paper we report on a set of benchmark results recently obtained by CERN openlab when compari...
This thesis deals with how to develop scientific computing software that runs efficiently on multico...
Recently a number of advanced architecture machines have become commercially available. These new ma...
Sun Microsystems recently announced a new line of high-performance Sun FireTM Midframe servers and S...
Many/multi-core supercomputers provide a natural programming paradigm for hybrid MPI/OpenMP scientif...
We theoretically and experimentally analyse the efficiency with which a wide range of important scie...
ABSTRACT. BlueGene/P (BG/P) is the second generation BlueGene architecture from IBM, succeeding Blue...
With POWER8 a new generation of POWER processors became available. This architecture features a mode...
The growing gap between sustained and peak performance for scientific applications has become a well...
Thesis: S.M., Massachusetts Institute of Technology, Department of Nuclear Science and Engineering, ...
There is a growing gap between the peak speed of parallel computing systems and the actual delivered...
There is a growing gap between the peak speed of parallel computing systems and the actual delivere...
Energy consumption is a major concern with high performance multicore systems. In this paper, we exp...
Recently, researchers at NASA Ames have defined a set of computational benchmarks designed to measur...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing...
In this paper we report on a set of benchmark results recently obtained by CERN openlab when compari...
This thesis deals with how to develop scientific computing software that runs efficiently on multico...
Recently a number of advanced architecture machines have become commercially available. These new ma...
Sun Microsystems recently announced a new line of high-performance Sun FireTM Midframe servers and S...
Many/multi-core supercomputers provide a natural programming paradigm for hybrid MPI/OpenMP scientif...
We theoretically and experimentally analyse the efficiency with which a wide range of important scie...
ABSTRACT. BlueGene/P (BG/P) is the second generation BlueGene architecture from IBM, succeeding Blue...
With POWER8 a new generation of POWER processors became available. This architecture features a mode...
The growing gap between sustained and peak performance for scientific applications has become a well...
Thesis: S.M., Massachusetts Institute of Technology, Department of Nuclear Science and Engineering, ...
There is a growing gap between the peak speed of parallel computing systems and the actual delivered...
There is a growing gap between the peak speed of parallel computing systems and the actual delivere...