International audienceData locality optimization is a well-known goal when handling programs that must run as fast as possible or use a minimum amount of energy. However, usual techniques never address the significant impact of numerous stalled processor cycles that may occur when consecutive load and store instructions are accessing the same memory location. We show that two versions of the same program may exhibit similar memory performance, while performing very differently regarding their execution times because of the stalled processor cycles generated by many pipeline hazards. We propose a new programming structure called ''xfor'', enabling the explicit control of the way data locality is optimized in a program and thus, to control th...
Software prefetching and locality optimizations are techniques for overcoming the gap between proces...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap betw...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
International audienceData locality optimization is a well-known goal when handling programs that mu...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
New trends such as the internet-of-things and smart homes push the demands for energy-efficiency. Ch...
The evolution of computer systems to continuously improve execution efficiency has traditionally emb...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
Green computing techniques aim to reduce the power foot print of modern embedded devices with partic...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
In memory hierarchies, programs can be speeded up by increasing their degree of locality. This paper...
Memory accesses in modern processors are both far slower and vastly more energy-expensive than the a...
Superscalar processors contain large, complex structures to hold data and instructions as they wait ...
The gap between processor speed and memory latency has led to the use of caches in the memory system...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
Software prefetching and locality optimizations are techniques for overcoming the gap between proces...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap betw...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
International audienceData locality optimization is a well-known goal when handling programs that mu...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
New trends such as the internet-of-things and smart homes push the demands for energy-efficiency. Ch...
The evolution of computer systems to continuously improve execution efficiency has traditionally emb...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
Green computing techniques aim to reduce the power foot print of modern embedded devices with partic...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
In memory hierarchies, programs can be speeded up by increasing their degree of locality. This paper...
Memory accesses in modern processors are both far slower and vastly more energy-expensive than the a...
Superscalar processors contain large, complex structures to hold data and instructions as they wait ...
The gap between processor speed and memory latency has led to the use of caches in the memory system...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
Software prefetching and locality optimizations are techniques for overcoming the gap between proces...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap betw...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...