This thesis introduces the data-triggered threads (DTT) programming and execution model. Unlike threads in conventional parallel programming models, the DTT model initiates threads on changes to memory locations. This enables increased parallelism and the elimination of redundant, unnecessary computation. This thesis shows that 78% of all loads fetch redundant data, leading to a high incidence of redundant computation. By expressing computation through the DTT model, that computation is executed once when the data changes, and is skipped whenever the data does not change. The set of C SPEC benchmarks show performance speedup of up to 5.9X, and averaging 46% with architectural support. To improve the generality of the DTT model, this thesis ...
grantor: University of TorontoThread-Level Data Speculation (TLDS) aim to improve the perf...
Across the wide range of multiprocessor architectures, all seem to share one common problem: they ar...
Speculative multithreading $(SpMT)$ promises to be an effective mechanism for parallelizing non-nume...
The data-triggered threads (DTT) programming and execution model can increase parallelism and elimin...
This paper presents CDTT, a compiler framework that takes C/C++ code and automatically generates a b...
Data-centric computing becomes increasingly important because of the rapid growth of application dat...
Current computing systems are mostly focused on achieving performance, programmability, energy effic...
Research on programming distributed memory multiprocessors has resulted in a well-understood program...
Threads provide a useful programming model for asynchronous behavior because of their ability to enc...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
Starting from a Data-Flow execution model called “DF-Threads”, we defined a minimalistic API to enab...
This paper focuses on the use of distributed memory multithreaded environments in data parallel prog...
Even though chip multiprocessors have emerged as the predominant organization for future microproces...
(eng) This paper focuses on the use of distributed memory multithreaded environments in data paralle...
This thesis studies efficient runtime systems for parallelism management (multithreading) and memory...
grantor: University of TorontoThread-Level Data Speculation (TLDS) aim to improve the perf...
Across the wide range of multiprocessor architectures, all seem to share one common problem: they ar...
Speculative multithreading $(SpMT)$ promises to be an effective mechanism for parallelizing non-nume...
The data-triggered threads (DTT) programming and execution model can increase parallelism and elimin...
This paper presents CDTT, a compiler framework that takes C/C++ code and automatically generates a b...
Data-centric computing becomes increasingly important because of the rapid growth of application dat...
Current computing systems are mostly focused on achieving performance, programmability, energy effic...
Research on programming distributed memory multiprocessors has resulted in a well-understood program...
Threads provide a useful programming model for asynchronous behavior because of their ability to enc...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
Starting from a Data-Flow execution model called “DF-Threads”, we defined a minimalistic API to enab...
This paper focuses on the use of distributed memory multithreaded environments in data parallel prog...
Even though chip multiprocessors have emerged as the predominant organization for future microproces...
(eng) This paper focuses on the use of distributed memory multithreaded environments in data paralle...
This thesis studies efficient runtime systems for parallelism management (multithreading) and memory...
grantor: University of TorontoThread-Level Data Speculation (TLDS) aim to improve the perf...
Across the wide range of multiprocessor architectures, all seem to share one common problem: they ar...
Speculative multithreading $(SpMT)$ promises to be an effective mechanism for parallelizing non-nume...