Simultaneous multithreading architectures have been de-fined previously with fully shared execution resources. When one thread in such an architecture experiences a very long-latency operation, such as a load miss, the thread will eventually stall, potentially holding resources which other threads could be using to make forward progress. This paper shows that in many cases it is better to free the resources associated with a stalled thread rather than keep that thread ready to immediately begin execution upon return of the loaded data. Several possible architectures are examined, and some simple solutions are shown to be very effective, achieving speedups close to 6.0 in some cases, and averaging 15 % speedup with four threads and over 100%...
grantor: University of TorontoLarge communication latency is a key obstacle to achieving h...
Multithreading techniques used within computer processors aim to provide the computer system with ...
This work presents and evaluates a novel processor microarchitecture which combines two paradigms: a...
Multithreaded architectures use the parallelism in programs to tolerate long latencies for communica...
Simultaneous multithreading (SMT) processors fetch instructions from several threads, increasing the...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
A thread executing on a simultaneous multithreading (SMT) processor that experiences a long-latency ...
As processor clock frequencies continue to improve at a rate that exceeds the rate of improvement in...
Abstract. Threads experiencing long-latency loads on a simultaneous multith-reading (SMT) processor ...
grantor: University of TorontoMemory latency is becoming an increasingly important perform...
Multithreading has been proposed as an architectural strategy for tolerating latency in multiprocess...
A thread executing on a simultaneous multithreading (SMT) processor that experience a long-latency l...
Threads experiencing long-latency loads on a simultaneous multithreading (SMT) processor may clog sh...
Existing multiprocessor synchronization mechanisms are relatively heavyweight, due in part to the le...
Since the era of vector and pipelined computing, the computational speed is limited by the memory ac...
grantor: University of TorontoLarge communication latency is a key obstacle to achieving h...
Multithreading techniques used within computer processors aim to provide the computer system with ...
This work presents and evaluates a novel processor microarchitecture which combines two paradigms: a...
Multithreaded architectures use the parallelism in programs to tolerate long latencies for communica...
Simultaneous multithreading (SMT) processors fetch instructions from several threads, increasing the...
Multithreaded multiprocessor systems (MMS) have been proposed to tolerate long latencies for communi...
A thread executing on a simultaneous multithreading (SMT) processor that experiences a long-latency ...
As processor clock frequencies continue to improve at a rate that exceeds the rate of improvement in...
Abstract. Threads experiencing long-latency loads on a simultaneous multith-reading (SMT) processor ...
grantor: University of TorontoMemory latency is becoming an increasingly important perform...
Multithreading has been proposed as an architectural strategy for tolerating latency in multiprocess...
A thread executing on a simultaneous multithreading (SMT) processor that experience a long-latency l...
Threads experiencing long-latency loads on a simultaneous multithreading (SMT) processor may clog sh...
Existing multiprocessor synchronization mechanisms are relatively heavyweight, due in part to the le...
Since the era of vector and pipelined computing, the computational speed is limited by the memory ac...
grantor: University of TorontoLarge communication latency is a key obstacle to achieving h...
Multithreading techniques used within computer processors aim to provide the computer system with ...
This work presents and evaluates a novel processor microarchitecture which combines two paradigms: a...