Hardware acceleration is a widely accepted solution for performance and energy efficient computation because it removes unnecessary hardware for general computation while delivering exceptional performance via specialized control paths and execution units. The spectrum of accelerators available today ranges from coarse-grain off-load engines such as GPUs to fine-grain instruction set extensions such as SSE. This research explores the benefits and challenges of managing memory at the data-structure level and exposing those operations directly to the ISA. We call these instructions. Abstract Datatype Instructions (ADIs). This paper quantifies the performance and energy impact of ADIs on the instruction and data cache hierarchies. For instruct...
The influence of embedded systems is felt in many aspects of our daily lives; being particularly app...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...
The failure of Dennard scaling [Bohr, 2007] and the rapid growth of data produced and consumed daily...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
The TRIPS system employs a new instruction set architecture (ISA) called Explicit Data Graph Executi...
Exposing details of the processor datapath to the programmer is motivated by improvements in the ene...
This paper identifies a new opportunity for improving the efficiency of a processor core: memory acc...
On-chip caches have been playing an important role in achieving high performance processors. In part...
Using FPGA-based acceleration of high-performance computing (HPC) applications to reduce energy and ...
Increasing demand for power-efficient, high-performance computing has spurred a growing number and d...
Although multi-threading processors can increase the performance of embedded systems with a minimum ...
This paper introduces Way Stealing, a simple architectural modification to a cache-based processor t...
Way Stealing is a simple architectural modification to a cache-based processor that increases the da...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
The influence of embedded systems is felt in many aspects of our daily lives; being particularly app...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...
The failure of Dennard scaling [Bohr, 2007] and the rapid growth of data produced and consumed daily...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
The TRIPS system employs a new instruction set architecture (ISA) called Explicit Data Graph Executi...
Exposing details of the processor datapath to the programmer is motivated by improvements in the ene...
This paper identifies a new opportunity for improving the efficiency of a processor core: memory acc...
On-chip caches have been playing an important role in achieving high performance processors. In part...
Using FPGA-based acceleration of high-performance computing (HPC) applications to reduce energy and ...
Increasing demand for power-efficient, high-performance computing has spurred a growing number and d...
Although multi-threading processors can increase the performance of embedded systems with a minimum ...
This paper introduces Way Stealing, a simple architectural modification to a cache-based processor t...
Way Stealing is a simple architectural modification to a cache-based processor that increases the da...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
The influence of embedded systems is felt in many aspects of our daily lives; being particularly app...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...