International audienceAdapting a source code to the specificity of its host hardware represents one way to implement software optimization. This allows to benefit from processors that are primarily designed to improve system performance. To reach such a software/hard-ware fitting without narrowing the scope of the optimization to few executions, one needs to have at his disposal relevant performance models of the considered hardware. This paper proposes a new method to optimize software kernels by considering their data-access mode. The proposed method permits to build a data-cache-miss model of a given application regarding its specific memory-access pattern. We apply our method in order to evaluate some custom implementations of matrix da...
This research is part of a co-design project that has the goal of designing hardware syste...
Data locality is central to modern computer designs. The widening gap between processor speed and me...
In this paper, we present a novel linear time algorithm for data remapping that is (i) ligh...
International audienceAdapting a source code to the specificity of its host hardware represents one ...
International audienceAdapting a source code to the specificity of its host hardware represents one ...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
In this paper we propose an instruction to accelerate software caches. While DMAs are very efficient...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
With the software applications increasing in complexity, description of hardware is becoming increas...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
The central data structures for many applications in scientific computing are large multidimensional...
For many applications, cache misses are the primary performance bottleneck. Even though much researc...
Modern processors apply sophisticated techniques, such as deep cache hierarchies and hardware prefet...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
This research is part of a co-design project that has the goal of designing hardware syste...
Data locality is central to modern computer designs. The widening gap between processor speed and me...
In this paper, we present a novel linear time algorithm for data remapping that is (i) ligh...
International audienceAdapting a source code to the specificity of its host hardware represents one ...
International audienceAdapting a source code to the specificity of its host hardware represents one ...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
In this paper we propose an instruction to accelerate software caches. While DMAs are very efficient...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
With the software applications increasing in complexity, description of hardware is becoming increas...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
The central data structures for many applications in scientific computing are large multidimensional...
For many applications, cache misses are the primary performance bottleneck. Even though much researc...
Modern processors apply sophisticated techniques, such as deep cache hierarchies and hardware prefet...
grantor: University of TorontoThe latency of accessing instructions and data from the memo...
This research is part of a co-design project that has the goal of designing hardware syste...
Data locality is central to modern computer designs. The widening gap between processor speed and me...
In this paper, we present a novel linear time algorithm for data remapping that is (i) ligh...