Architecture Support for Improving Bulk Memory Copying and Initialization Performance

Xiaowei Jiang
Yan Solihin
Li Zhao
Ravishankar Iyer

Publication date

January 2009

DOI

10.1109/pact.2009.31

Abstract

Abstract—Bulk memory copying and initialization is one of the most ubiquitous operations performed in current computer systems by both user applications and Operating Systems. While many current systems rely on a loop of loads and stores, there are proposals to introduce a single instruction to perform bulk memory copying. While such an instruction can improve performance due to generating fewer TLB and cache accesses, and requiring fewer pipeline resources, in this paper we show that the key to significantly improving the performance is removing pipeline and cache bottlenecks of the code that follows the instructions. We show that the bottlenecks arise due to (1) the pipeline clogged by the copying instruction, (2) lengthened critical path...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Architecture Support for Improving Bulk Memory Copying and Initialization Performance

Abstract

Extracted data

Architecture Support for Improving Bulk Memory Copying and Initialization Performance

Abstract

Extracted data

Related items

Related items