The gap between CPU speed and memory speed in modern com-puter systems is widening as new generations of hardware are introduced. Loop blocking and prefetching transformations help bridge this gap for regular applications; however, these techniques aren’t as effective for irregular applications. This paper investi-gates using data and computation reordering to improve memory hierarchy utilization for irregular applications on systems with multi-level memory hierarchies. We evaluate the impact of data and computation reordering using space-filling curves and intro-duce multi-Ievel blocking as a new computation reordering strat-egy for irregular applications. In experiments that applied specific combinations of data and computation reordering...
Irregular applications frequently exhibit poor performance on contemporary computer architectures, i...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
AbstractA current challenge for computer users is to fully exploit performance of new Multicore syst...
The gap between CPU speed and memory speed in modern computer systems is widening as new generations...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
textModern computer systems spend a substantial fraction of their running time waiting for data from...
Irregular applications frequently exhibit poor performance on contemporary computer architectures, i...
We demonstrate that data reordering can substantially improve the performance of fine-grained irregu...
In computer systems, latency tolerance is the use of concurrency to achieve high performance in spit...
"What Mathematics is to Physics, Data traversal is to High-performance computing." The world of Comp...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
While many parallel applications exhibit good spatial locality, other important codes in areas like ...
Irregular applications frequently exhibit poor performance on contemporary computer architectures, i...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
AbstractA current challenge for computer users is to fully exploit performance of new Multicore syst...
The gap between CPU speed and memory speed in modern computer systems is widening as new generations...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
textModern computer systems spend a substantial fraction of their running time waiting for data from...
Irregular applications frequently exhibit poor performance on contemporary computer architectures, i...
We demonstrate that data reordering can substantially improve the performance of fine-grained irregu...
In computer systems, latency tolerance is the use of concurrency to achieve high performance in spit...
"What Mathematics is to Physics, Data traversal is to High-performance computing." The world of Comp...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
While many parallel applications exhibit good spatial locality, other important codes in areas like ...
Irregular applications frequently exhibit poor performance on contemporary computer architectures, i...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
AbstractA current challenge for computer users is to fully exploit performance of new Multicore syst...