The data-triggered threads (DTT) programming and execution model can increase parallelism and eliminate redundant compu-tation. However, the initial proposal requires signicant architec-ture support, which impedes existing applications and architectures from taking advantage of this model. This work proposes a pure software solution that supports the DTT model without any hardware support. This research uses a prototype compiler and runtime libraries running on top of existing machines. Several enhancements to the initial software implemen-tation are presented, which further improve the performance. The software runtime system improves the performance of se-rial C SPEC benchmarks by 15 % on a Nehalem processor, but by over 7X over the full ...
Even though chip multiprocessors have emerged as the predominant organization for future microproces...
The two current approaches to increasing computer speed are giving individual processors the ability...
Until recently, a steadily rising clock rate and other uniprocessor microarchitectural improvements ...
This thesis introduces the data-triggered threads (DTT) programming and execution model. Unlike thre...
This paper presents CDTT, a compiler framework that takes C/C++ code and automatically generates a b...
Data-centric computing becomes increasingly important because of the rapid growth of application dat...
Current computing systems are mostly focused on achieving performance, programmability, energy effic...
Threads provide a useful programming model for asynchronous behavior because of their ability to enc...
Research on programming distributed memory multiprocessors has resulted in a well-understood program...
Threads are units of concurrent execution that can be viewed as abstract data types (ADTs) with oper...
We believe that future many-core architectures should support a simple and scalable way to execute m...
Multi-core processors are ubiquitous in all market segments from embedded to high performance comput...
We describe the design and implementation of the Distributed Threads System (DTS), a programming env...
The Threaded Abstract Machine (TAM) refines dataflow execution models to address the critical constr...
Starting from a Data-Flow execution model called “DF-Threads”, we defined a minimalistic API to enab...
Even though chip multiprocessors have emerged as the predominant organization for future microproces...
The two current approaches to increasing computer speed are giving individual processors the ability...
Until recently, a steadily rising clock rate and other uniprocessor microarchitectural improvements ...
This thesis introduces the data-triggered threads (DTT) programming and execution model. Unlike thre...
This paper presents CDTT, a compiler framework that takes C/C++ code and automatically generates a b...
Data-centric computing becomes increasingly important because of the rapid growth of application dat...
Current computing systems are mostly focused on achieving performance, programmability, energy effic...
Threads provide a useful programming model for asynchronous behavior because of their ability to enc...
Research on programming distributed memory multiprocessors has resulted in a well-understood program...
Threads are units of concurrent execution that can be viewed as abstract data types (ADTs) with oper...
We believe that future many-core architectures should support a simple and scalable way to execute m...
Multi-core processors are ubiquitous in all market segments from embedded to high performance comput...
We describe the design and implementation of the Distributed Threads System (DTS), a programming env...
The Threaded Abstract Machine (TAM) refines dataflow execution models to address the critical constr...
Starting from a Data-Flow execution model called “DF-Threads”, we defined a minimalistic API to enab...
Even though chip multiprocessors have emerged as the predominant organization for future microproces...
The two current approaches to increasing computer speed are giving individual processors the ability...
Until recently, a steadily rising clock rate and other uniprocessor microarchitectural improvements ...