AbstractSparse scientific codes face grave performance challenges as memory bandwidth limitations grow on multi-core architectures. We investigate the memory behavior of a key sparse scientific kernel and study model-driven performance evaluation in this scope. We propose the Coupled Reuse-Cache Model (CRC Model), to enable multilevel cache performance analysis of parallel sparse codes. Our approach builds separate probabilistic application and hardware models, which are coupled to discover unprecedented insight into software-hardware interactions in the cache hierarchy. We evaluate our model's predictive performance with the pervasive sparse matrix-vector product kernel, using 1 to 16 cores and multiple cache configurations. For multi-core...
This research is part of a co-design project that has the goal of designing hardware syste...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Performance is an important aspect of computer systems since it directly affects user experience. On...
AbstractSparse scientific codes face grave performance challenges as memory bandwidth limitations gr...
. Many scientific applications handle compressed sparse matrices. Cache behavior during the executio...
The context of this work are performance models of software systems, which are used for predicting p...
The study and understanding of memory hierarchy behavior is essential, as it is critical to current ...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Scientific applications will have to scale to many thousands of processor cores to reach petascale. ...
While there are many studies on the locality of dense codes, few deal with the locality of sparse co...
While much work has been devoted to the study of cache behavior during the execution of codes with r...
Sparse matrices are in the kernel of numerical applications. Their compressed storage, which permits...
This research is part of a co-design project that has the goal of designing hardware syste...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Performance is an important aspect of computer systems since it directly affects user experience. On...
AbstractSparse scientific codes face grave performance challenges as memory bandwidth limitations gr...
. Many scientific applications handle compressed sparse matrices. Cache behavior during the executio...
The context of this work are performance models of software systems, which are used for predicting p...
The study and understanding of memory hierarchy behavior is essential, as it is critical to current ...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Scientific applications will have to scale to many thousands of processor cores to reach petascale. ...
While there are many studies on the locality of dense codes, few deal with the locality of sparse co...
While much work has been devoted to the study of cache behavior during the execution of codes with r...
Sparse matrices are in the kernel of numerical applications. Their compressed storage, which permits...
This research is part of a co-design project that has the goal of designing hardware syste...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Performance is an important aspect of computer systems since it directly affects user experience. On...