The slowing pace of commodity microprocessor performance improvements combined with ever-increasing chip power demands has become of utmost concern to computational scientists. Therefore, the high performance computing community is examining alternative architectures that address the limitations of modern superscalar designs. In this work, we examine STI's forthcoming Cell processor: a novel, low-power architecture that combines a PowerPC core with eight independent SIMD processing units coupled with a software-controlled memory to offer high FLOP/s/Watt. Since neither Cell hardware nor cycle-accurate simulators are currently publicly available, we develop an analytic framework to predict Cell performance on dense and sparse matrix operatio...
The call for ever-increasing model resolutions and physical processes in climate and weather models ...
International audienceThis paper presents the first deployment of the Fast Multipole Method on the C...
In recent years, scaling of single-core superscalar processor perfor-mance has slowed due to complex...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
In this work, we examine the potential of using the recently-released STI Cell processor as a buildi...
The Cell Broad Engine (BE) Processor has unique memory access architecture besides its powerful comp...
Developed for multimedia and game applications, as well as other numerically intensive workloads, th...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
Abstract—Industry is moving towards many-core processors, which are expected to consist of tens or e...
Matrix factorization (or often called decomposition) is a frequently used kernel in a large number o...
Abstract. Various processor architectures have been proposed until today, and the performance has im...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Mainstream processor development is mostly targeted at compatibility and continuity. Thus, the proce...
The call for ever-increasing model resolutions and physical processes in climate and weather models ...
International audienceThis paper presents the first deployment of the Fast Multipole Method on the C...
In recent years, scaling of single-core superscalar processor perfor-mance has slowed due to complex...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing...
The slowing pace of commodity microprocessor performance improvements combined with ever-increasing ...
In this work, we examine the potential of using the recently-released STI Cell processor as a buildi...
The Cell Broad Engine (BE) Processor has unique memory access architecture besides its powerful comp...
Developed for multimedia and game applications, as well as other numerically intensive workloads, th...
The Cell Broadband Engine processor is a powerful processor capable of over 220 GFLOPS. It is highly...
Abstract—Industry is moving towards many-core processors, which are expected to consist of tens or e...
Matrix factorization (or often called decomposition) is a frequently used kernel in a large number o...
Abstract. Various processor architectures have been proposed until today, and the performance has im...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
We are witnessing a dramatic change in computer architecture due to the multicore paradigm shift, as...
Mainstream processor development is mostly targeted at compatibility and continuity. Thus, the proce...
The call for ever-increasing model resolutions and physical processes in climate and weather models ...
International audienceThis paper presents the first deployment of the Fast Multipole Method on the C...
In recent years, scaling of single-core superscalar processor perfor-mance has slowed due to complex...