Modern microprocessor designs continue to obtain impressive per-formance gains through increasing clock rates and advances in the parallelism obtained via micro-architecture design. Unfortunately, corresponding improvements in memory design technology have not been realized, resulting in latencies of over 100 cycles between processors and main memory. This ever-increasing gap in speed has pushed the current memory-hierarchy approach to its limit. Traditional approaches to memory-hierarchy management have not yielded satisfactory results. Hardware solutions require more power and energy than desired and do not scale well. Compiler so-lutions tend to miss too many optimization opportunities because of limited compile-time knowledge of run-tim...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Application performance on modern microprocessors depends heavily on performance related characteris...
In modern computers, memory hierarchies play a paramount role in improving the average execution tim...
Computer memory is organized into a hierarchy. At the highest level are the processor registers, nex...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
New architectures for extreme-scale computing need to be designed for higher energy efficiency than ...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Summarization: By examining the rate at which successive generations of processor and DRAM cycle tim...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Application performance on modern microprocessors depends heavily on performance related characteris...
In modern computers, memory hierarchies play a paramount role in improving the average execution tim...
Computer memory is organized into a hierarchy. At the highest level are the processor registers, nex...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
New architectures for extreme-scale computing need to be designed for higher energy efficiency than ...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Summarization: By examining the rate at which successive generations of processor and DRAM cycle tim...
Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growin...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...