This work introduces a novel object-centric bank partition (OBP) to mitigate both the inter-thread and intra-thread interference. The key idea is to break bank sharing relationship among the simultaneously accessed data objects, instead of only focusing on the co-running threads. At sampling runs, we profile each thread to identify the simultaneously accessed objects. At actual runs, using the profiling information, the operating system partition banks at both the thread and object level. We have implemented OBP in Linux 2.6.32 kernel and evaluated its benefits on real machines. Experimental results show that OBP achieves an encouraging enhancement it terms of performance. ? 2014 ACM.EI
As the increasing of issue width has diminishing returns with superscalar processor, thread parallel...
Trying to attack the problem of resource contention, created by multiple parallel applications runni...
Most compilers focus on optimizing performance, often at the expense of memory, but efficient memory...
Applications running concurrently in CMP systems interfere with each other at DRAM memory, leading t...
Memory interference is a critical impediment to system performance in MPSoCs. To address this proble...
Main memory is a major shared resource among cores in a multicore system. If the interference betwee...
At the level of multi-core processors that share the same cache, data sharing among threads which be...
The last decade has seen the transition from single-core processors to multi-cores and many-cores. T...
Chip-level multiprocessors (CMP) have multiple processing cores (Cores) and generally have their cac...
Abstract—Memory profiling is the process of collecting memory address traces during the execution of...
Common implementations of core memory allocation components, like the Linux buddy system, handle con...
The concurrent memory reclamation problem is that of devising a way for a deallocating thread to ver...
Garbage collection can be a performance bottleneck in large distributed, multi-threaded applications...
We propose a novel kernel-level memory allocator, called M3 (Mcube, Multi-core Multi-bank Memory all...
The limitation imposed by instruction-level parallelism (ILP) has motivated the use of thread-level ...
As the increasing of issue width has diminishing returns with superscalar processor, thread parallel...
Trying to attack the problem of resource contention, created by multiple parallel applications runni...
Most compilers focus on optimizing performance, often at the expense of memory, but efficient memory...
Applications running concurrently in CMP systems interfere with each other at DRAM memory, leading t...
Memory interference is a critical impediment to system performance in MPSoCs. To address this proble...
Main memory is a major shared resource among cores in a multicore system. If the interference betwee...
At the level of multi-core processors that share the same cache, data sharing among threads which be...
The last decade has seen the transition from single-core processors to multi-cores and many-cores. T...
Chip-level multiprocessors (CMP) have multiple processing cores (Cores) and generally have their cac...
Abstract—Memory profiling is the process of collecting memory address traces during the execution of...
Common implementations of core memory allocation components, like the Linux buddy system, handle con...
The concurrent memory reclamation problem is that of devising a way for a deallocating thread to ver...
Garbage collection can be a performance bottleneck in large distributed, multi-threaded applications...
We propose a novel kernel-level memory allocator, called M3 (Mcube, Multi-core Multi-bank Memory all...
The limitation imposed by instruction-level parallelism (ILP) has motivated the use of thread-level ...
As the increasing of issue width has diminishing returns with superscalar processor, thread parallel...
Trying to attack the problem of resource contention, created by multiple parallel applications runni...
Most compilers focus on optimizing performance, often at the expense of memory, but efficient memory...