The slowing pace of commodity microprocessor performance improvements combined with ever-increasing chip power demands has become of utmost concern to computational scientists. As a result, the high performance computing community is examining alternative architectures that address the limitations of modern cache-based designs. In this work, we examine the potential of using the recently-released STI Cell processor as a building block for future high-end computing systems. Our work contains several novel contributions. First, we introduce a performance model for Cell and apply it to several key scientific computing kernels: dense matrix multiply, sparse matrix vector multiply, stencil computations, and 1D/2D FFTs. The difficulty of programm...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceOn modern architectures, the performance of 32-bit operations is often at leas...
AbstractWe have designed Particle-in-Cell algorithms for emerging architectures. These algorithms sh...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
In this work, we examine the potential of using the recently-released STI Cell processor as a buildi...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
The STI CELL processor introduces pioneering solutions in processor architecture. At the same time i...
Stencil-based kernels constitute the core of many scientific applications on block-structured grids....
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Mainstream processor development is mostly targeted at compatibility and continuity. Thus, the proce...
Matrix factorization (or often called decomposition) is a frequently used kernel in a large number o...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
This paper evaluates the performance of bioinformatics applications on the Cell Broadband Engine (Ce...
The call for ever-increasing model resolutions and physical processes in climate and weather models ...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceOn modern architectures, the performance of 32-bit operations is often at leas...
AbstractWe have designed Particle-in-Cell algorithms for emerging architectures. These algorithms sh...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
In this work, we examine the potential of using the recently-released STI Cell processor as a buildi...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
The STI CELL processor introduces pioneering solutions in processor architecture. At the same time i...
Stencil-based kernels constitute the core of many scientific applications on block-structured grids....
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Mainstream processor development is mostly targeted at compatibility and continuity. Thus, the proce...
Matrix factorization (or often called decomposition) is a frequently used kernel in a large number o...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
This paper evaluates the performance of bioinformatics applications on the Cell Broadband Engine (Ce...
The call for ever-increasing model resolutions and physical processes in climate and weather models ...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceOn modern architectures, the performance of 32-bit operations is often at leas...
AbstractWe have designed Particle-in-Cell algorithms for emerging architectures. These algorithms sh...