Hardware specialization is becoming a promising paradigm for future microprocessors. Unfortunately, by its very nature, the exploration of specialization ideas, (the artifact is dubbed an ?accelerator?) are developed, evaluated, and published as end-to-end vertical silos spanning application, language/compiler, and hardware architecture, with per-accelerator customized tools, and little opportunity for cross-application of ideas from one accelerator into another. This paper develops a novel program representation suitable for the hardware specialization paradigm, called the transformable dependence graph (TDG), which combines semantic information about program properties and low-level hardware events (cache misses, branch mis-predictions...
This paper identifies a new opportunity for improving the efficiency of a processor core: memory acc...
This report examines ultra-fine grain machine parallelism determined by various hardware styles and ...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Since the invention of the microprocessor in 1971, the computational capacity of the microprocessor ...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
In the last few decades, Von-Neumann super-scalar processors have been the superior approach for imp...
Hardware acceleration in the form of customized datapath and control circuitry tuned to specific app...
Tuning the performance of applications requires understanding the interactions between code and targ...
General-purpose processors are often incapable of achieving the challenging cost, performance, and p...
Modern reconfigurable computing systems feature pow-erful hybrid architectures with multiple micropr...
In recent years, the computing landscape has seen a shift towards specialized accelerators since the...
Hardware accelerators have become permanent features in the post-Dennard computing landscape, displa...
Domain-specific acceleration is now a “must” for all the computing spectrum, going from high perform...
Performance increase, in terms of faster execution and energy efficiency, is a never-ending research...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
This paper identifies a new opportunity for improving the efficiency of a processor core: memory acc...
This report examines ultra-fine grain machine parallelism determined by various hardware styles and ...
Applications may have unintended performance problems in spite of compiler optimizations, because of...
Since the invention of the microprocessor in 1971, the computational capacity of the microprocessor ...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
In the last few decades, Von-Neumann super-scalar processors have been the superior approach for imp...
Hardware acceleration in the form of customized datapath and control circuitry tuned to specific app...
Tuning the performance of applications requires understanding the interactions between code and targ...
General-purpose processors are often incapable of achieving the challenging cost, performance, and p...
Modern reconfigurable computing systems feature pow-erful hybrid architectures with multiple micropr...
In recent years, the computing landscape has seen a shift towards specialized accelerators since the...
Hardware accelerators have become permanent features in the post-Dennard computing landscape, displa...
Domain-specific acceleration is now a “must” for all the computing spectrum, going from high perform...
Performance increase, in terms of faster execution and energy efficiency, is a never-ending research...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
This paper identifies a new opportunity for improving the efficiency of a processor core: memory acc...
This report examines ultra-fine grain machine parallelism determined by various hardware styles and ...
Applications may have unintended performance problems in spite of compiler optimizations, because of...