The trend in high-performance microprocessor design is toward increasing computational power on the chip. Microprocessors can now process dramatically more data per machine cycle than previous models. Unfortunately, memory speeds have not kept pace. The result is an imbalance between computation speed and memory speed. This imbalance is leading machine designers to use more complicated memory hierarchies. In turn, programmers are explicitly restructuring codes to perform well on particular memory systems, leading to machine-specific programs. It is our belief that machine-specific programming is a step in the wrong direction. Compilers, not programmers, should handle machine-specific implementation details. To this end, this thesis develops...
Application performance on modern microprocessors depends heavily on performance related characteris...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
Modern microprocessor designs continue to obtain impressive per-formance gains through increasing cl...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
Effective memory hierarchy utilization is critical to the performance of modern multiprocessor archi...
The performance of the memory hierarchy has become one of the most critical elements in the performa...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
We present a simple and novel framework for generating blocked codes for high-performance machines w...
150 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1986.This thesis presents a new ap...
Application performance on modern microprocessors depends heavily on performance related characteris...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...
The trend in high-performance microprocessor design is toward increasing computational power on the ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
Modern microprocessor designs continue to obtain impressive per-formance gains through increasing cl...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
Effective memory hierarchy utilization is critical to the performance of modern multiprocessor archi...
The performance of the memory hierarchy has become one of the most critical elements in the performa...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
We present a simple and novel framework for generating blocked codes for high-performance machines w...
150 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1986.This thesis presents a new ap...
Application performance on modern microprocessors depends heavily on performance related characteris...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
Processors have become faster at a much quicker rate than memory access time, creating wide gap betw...