We present a toolset to automatically optimize the cache efficiency of an arbitrary application by dynamically padding memory allocations. The toolset is also suitable to guide manual optimizations. Histograms are used to evaluate cache simulations of memory traces of the ap-plications considered. A general algorithm is presented that calculates optimized pad sets based on the informa-tion contained in the histograms. These pad sets can then be used to optimize further runs of the applications ex-amined. Experiments show that the cache hit rates of the modified applications are considerably increased
We describe novel techniques used for efficient simulation of memory in SimICS, an instruction leve...
Memory subsystem, in particular, cache design is important for both high performance and embedded co...
Abstract. In this paper, two tools are presented: an execution driven cache simulator which relates ...
The increasing use of microprocessor cores in embedded systems, as well as mobile and portable devic...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
This thesis presents a generic approach towards compiling fast execution-driven simulators, and appl...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
This thesis studies the use of software methods to improve memory performance in a heterogeneous cac...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
This paper presents a generic approach for compiling fast execution-driven simulators, and applies t...
Caches have become increasingly important with the widening gap between main memory and processor sp...
Application-specific system-on-chip platforms create the opportu-nity to customize the cache configu...
Abstract—An effectively designed and efficiently used memory hierarchy, composed of scratch-pads or ...
We describe novel techniques used for efficient simulation of memory in SimICS, an instruction leve...
Memory subsystem, in particular, cache design is important for both high performance and embedded co...
Abstract. In this paper, two tools are presented: an execution driven cache simulator which relates ...
The increasing use of microprocessor cores in embedded systems, as well as mobile and portable devic...
As multiprocessor systems-on-chip become a reality, perfor-mance modeling becomes a challenge. To qu...
This thesis presents a generic approach towards compiling fast execution-driven simulators, and appl...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
This thesis studies the use of software methods to improve memory performance in a heterogeneous cac...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
This paper presents a generic approach for compiling fast execution-driven simulators, and applies t...
Caches have become increasingly important with the widening gap between main memory and processor sp...
Application-specific system-on-chip platforms create the opportu-nity to customize the cache configu...
Abstract—An effectively designed and efficiently used memory hierarchy, composed of scratch-pads or ...
We describe novel techniques used for efficient simulation of memory in SimICS, an instruction leve...
Memory subsystem, in particular, cache design is important for both high performance and embedded co...
Abstract. In this paper, two tools are presented: an execution driven cache simulator which relates ...