International audienceSimultaneous Multi-Threading (SMT) is a hardware model in which different threads share the same instruction fetching unit. This model is a compromise between high parallelism and low hardware cost. Minimal Multi-Threading (MMT) is a technique recently proposed to share instructions and execution between threads in a SMT machine. In this paper we propose new ways to explore redundancies in the MMT execution model. First, we propose and evaluate a new thread reconvergence heuristics that handles function calls better than previous approaches. Second, we demonstrate the existence of substantial regularity in inter-thread memory access patterns. We validate our results on the four data-parallel applications present in the...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
We introduce explicit multi-threading (XMT), a decentralized architecture that exploits fine-grained...
A thread executing on a simultaneous multithreading (SMT) processor that experience a long-latency l...
International audienceSimultaneous Multi-Threading (SMT) is a hardware model in which different thre...
International audienceSimultaneous Multi-Threading (SMT) is a hardware model in which different thre...
capable of executing instructions from multiple threads in the same cycle. SMT in fact was introduce...
Simultaneous Multithreading (SMT) is proposed to improve pipeline throughput by overlapping executio...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Compiler optimizations are often driven by specific assumptions about the underlying architecture an...
We show that when multi-threaded benchmarks are executed on a Chip Multiprocessor (CMP), the threads...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Compiler optimizations are often driven by specific assumptions about the underlying architecture an...
The quest for high-performance has led to multi- and many-core systems. To push the performance of a...
In this paper, we propose Runahead Threads (RaT) as a valuable solution for both reducing resource c...
In this paper, we propose Runahead Threads (RaT) as a valuable solution for both reducing resource c...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
We introduce explicit multi-threading (XMT), a decentralized architecture that exploits fine-grained...
A thread executing on a simultaneous multithreading (SMT) processor that experience a long-latency l...
International audienceSimultaneous Multi-Threading (SMT) is a hardware model in which different thre...
International audienceSimultaneous Multi-Threading (SMT) is a hardware model in which different thre...
capable of executing instructions from multiple threads in the same cycle. SMT in fact was introduce...
Simultaneous Multithreading (SMT) is proposed to improve pipeline throughput by overlapping executio...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Compiler optimizations are often driven by specific assumptions about the underlying architecture an...
We show that when multi-threaded benchmarks are executed on a Chip Multiprocessor (CMP), the threads...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Compiler optimizations are often driven by specific assumptions about the underlying architecture an...
The quest for high-performance has led to multi- and many-core systems. To push the performance of a...
In this paper, we propose Runahead Threads (RaT) as a valuable solution for both reducing resource c...
In this paper, we propose Runahead Threads (RaT) as a valuable solution for both reducing resource c...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
We introduce explicit multi-threading (XMT), a decentralized architecture that exploits fine-grained...
A thread executing on a simultaneous multithreading (SMT) processor that experience a long-latency l...