Profiling can accurately analyze program behavior for select data inputs. We show that profiling can also predict program locality for inputs other than profiled ones. Here locality is defined by the dis-tance of data reuse. Studying whole-program data reuse may reveal global patterns not apparent in short-distance reuses or local con-trol flow. However, the analysis must meet two requirements to be useful. The first is efficiency. It needs to analyze all accesses to all data elements in full-size benchmarks and to measure distance of any length and in any required precision. The second is predication. Based on a few training runs, it needs to classify patterns as regular and irregular and, for regular ones, it should predict their (chang-i...
International audienceEmerging computer architectures will feature drastically decreased flops/byte ...
This paper presents a tool based on a new approach for analyzing the locality exhibited by data memo...
The growing memory wall requires that more attention is given to the data cache behavior of programs...
Profiling can effectively analyze program behavior and provide critical information for feedback-dir...
Abstract. Profiling can effectively analyze program behavior and provide critical information for fe...
Cache is one of the most widely used components in today's computing systems. Its performance is hea...
Feedback-directed optimization has become an increasingly important tool in designing and building o...
Feedback-directed optimization has become an increasingly impor-tant tool in designing and building ...
Since a program may have an infinite number of inputs, it is difficult to measure the exact performa...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
Locality increasingly determines system performance. As a rigor-ous and precise locality model, reus...
Locality, characterized by data reuses, determines caching performance. Reuse distance (i.e. LRU st...
As multicore processors implementing shared-memory programming models have become commonplace, analy...
Due to the huge speed gaps in the memory hierarchy of modern computer architectures, it is important...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
International audienceEmerging computer architectures will feature drastically decreased flops/byte ...
This paper presents a tool based on a new approach for analyzing the locality exhibited by data memo...
The growing memory wall requires that more attention is given to the data cache behavior of programs...
Profiling can effectively analyze program behavior and provide critical information for feedback-dir...
Abstract. Profiling can effectively analyze program behavior and provide critical information for fe...
Cache is one of the most widely used components in today's computing systems. Its performance is hea...
Feedback-directed optimization has become an increasingly important tool in designing and building o...
Feedback-directed optimization has become an increasingly impor-tant tool in designing and building ...
Since a program may have an infinite number of inputs, it is difficult to measure the exact performa...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
Locality increasingly determines system performance. As a rigor-ous and precise locality model, reus...
Locality, characterized by data reuses, determines caching performance. Reuse distance (i.e. LRU st...
As multicore processors implementing shared-memory programming models have become commonplace, analy...
Due to the huge speed gaps in the memory hierarchy of modern computer architectures, it is important...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
International audienceEmerging computer architectures will feature drastically decreased flops/byte ...
This paper presents a tool based on a new approach for analyzing the locality exhibited by data memo...
The growing memory wall requires that more attention is given to the data cache behavior of programs...