The trend in high-performance microprocessor design is toward increasing computational power on the chip. Microprocessors can now process dramatically more data per machine cycle than previous models. Unfortunately, memory speeds have not kept pace. The result is an imbalance between computation speed and memory speed. This imbalance is leading machine designers to use more complicated memory hierarchies. In turn, programmers are explicitly restructuring codes to perform well on particular memory systems, leading to machine-specific programs. It is our belief that machine-specific programming is a step in the wrong direction. Compilers, not programmers, should handle machine-specific implementation details. To this end, this thesis develops...
In this paper we present a method for determining the cache performance of the loop nests in a progr...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
Modern microprocessor designs continue to obtain impressive per-formance gains through increasing cl...
Many applications are memory intensive and thus are bounded by memory latency and bandwidth. While i...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The performance of the memory hierarchy has become one of the most critical elements in the performa...
On modern computers, the performance of programs is often limited by memory latency rather than by p...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
In this paper we present a method for determining the cache performance of the loop nests in a progr...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
In order to mitigate the impact of the constantly widening gap between processor speed and main memo...
Modern microprocessor designs continue to obtain impressive performance gains through increasing clo...
Modern microprocessor designs continue to obtain impressive per-formance gains through increasing cl...
Many applications are memory intensive and thus are bounded by memory latency and bandwidth. While i...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
The performance of the memory hierarchy has become one of the most critical elements in the performa...
On modern computers, the performance of programs is often limited by memory latency rather than by p...
An ideal high performance computer includes a fast processor and a multi-million byte memory of comp...
Programming languages that provide multidimensional arrays and a flat linear model of memory must im...
A tremendous amount of compiler research effort over the past ten years has been devoted to compensa...
In this paper we present a method for determining the cache performance of the loop nests in a progr...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...