In a multicore environment, inter-thread communication can provide valuable insights about application performance. Literature detecting inter-thread communication either employs hardware simulators or binary instrumentation. Those techniques bring both space and time overhead, which makes them impractical to use on real-life applications. Instead, we take a completely different approach that leverages hardware performance counters and debug registers to detect communication volume between threads. The information generated by our tool can be utilized in several places to guide optimizations, understand performance behavior, and compare architectural features. In this talk, I present the design details of our tool along with experim...
Funding: This work was generously supported by UK EPSRC Energise, grant number EP/V006290/1.This pap...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The complexity of an efficient thread management steadily rises with the number of processor cores a...
In this paper I compare inter-thread communication methods: blocking queue and LMAX Disruptor withou...
Understanding the performance of a multi-threaded appli- cation is difficult. The threads interfere ...
This paper presents COMPROF and COMPLACE, a novel profiling tool and thread placement technique for ...
The transition to multi-core architectures can be attributed mainly to fundamental limitations in cl...
Abstract: Multicore is an integrated circuit chip that uses two or more computational engines (cores...
Modern processors provide a multitude of opportunities for instruction-level parallelism that most c...
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-co...
International audienceAlthough processors become massively multicore and therefore new programming m...
While multicore processors improve overall chip throughput and hardware utilization, resource sharin...
Simultaneous multithreading (SMT) allows multiple hardware threads to execute concurrently on a proc...
The era of multi-core processors has begun. These multi- core processors represent a significant shi...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
Funding: This work was generously supported by UK EPSRC Energise, grant number EP/V006290/1.This pap...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The complexity of an efficient thread management steadily rises with the number of processor cores a...
In this paper I compare inter-thread communication methods: blocking queue and LMAX Disruptor withou...
Understanding the performance of a multi-threaded appli- cation is difficult. The threads interfere ...
This paper presents COMPROF and COMPLACE, a novel profiling tool and thread placement technique for ...
The transition to multi-core architectures can be attributed mainly to fundamental limitations in cl...
Abstract: Multicore is an integrated circuit chip that uses two or more computational engines (cores...
Modern processors provide a multitude of opportunities for instruction-level parallelism that most c...
High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-co...
International audienceAlthough processors become massively multicore and therefore new programming m...
While multicore processors improve overall chip throughput and hardware utilization, resource sharin...
Simultaneous multithreading (SMT) allows multiple hardware threads to execute concurrently on a proc...
The era of multi-core processors has begun. These multi- core processors represent a significant shi...
© 2016 ACM. Relentless technology scaling has made transistors more vulnerable to soft, or transient...
Funding: This work was generously supported by UK EPSRC Energise, grant number EP/V006290/1.This pap...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The complexity of an efficient thread management steadily rises with the number of processor cores a...