This paper identifies a new opportunity for improving the efficiency of a processor core: memory access phases of pro-grams. These are dynamic regions of programs where most of the instructions are devoted to memory access or address computation. These occur naturally in programs because of workload properties, or when employing an in-core acceler-ator, we get induced phases where the code execution on the core is access code. We observe such code requires an OOO core’s dataflow and dynamism to run fast and does not execute well on an in-order processor. However, an OOO core con-sumes much power, effectively increasing energy consumption and reducing the energy efficiency of in-core accelerators. We develop an execution model called memory ...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...
Summarization: Mapping computational intensive applications on reconfigurable technology for acceler...
Abstract The combination of growing transistor counts and limited power budget within a silicon die ...
This paper presents an evaluation of our Scheduled Dataflow (SDF) Processor. Recent focus in the fie...
Modern parallel programming models perform their best under the particular patterns they are tuned t...
This thesis proposes novel, efficient execution-paradigms for parallel heterogeneous architectures. ...
Abstract—The path towards future high performance comput-ers requires architectures able to efficien...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
Decoupled Access-Execute(DAE) is an innovative approach to optimize energy consumption of computer p...
The path towards future high performance computers requires architectures able to efficiently run mu...
Hardware acceleration is a widely accepted solution for performance and energy efficient computation...
There has been a resurgence of interest in dataflow architectures, because of their potential for ex...
It is now widely recognized that increased levels of parallelism are a necessary condition for impro...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...
Summarization: Mapping computational intensive applications on reconfigurable technology for acceler...
Abstract The combination of growing transistor counts and limited power budget within a silicon die ...
This paper presents an evaluation of our Scheduled Dataflow (SDF) Processor. Recent focus in the fie...
Modern parallel programming models perform their best under the particular patterns they are tuned t...
This thesis proposes novel, efficient execution-paradigms for parallel heterogeneous architectures. ...
Abstract—The path towards future high performance comput-ers requires architectures able to efficien...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
Decoupled Access-Execute(DAE) is an innovative approach to optimize energy consumption of computer p...
The path towards future high performance computers requires architectures able to efficiently run mu...
Hardware acceleration is a widely accepted solution for performance and energy efficient computation...
There has been a resurgence of interest in dataflow architectures, because of their potential for ex...
It is now widely recognized that increased levels of parallelism are a necessary condition for impro...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Driven by the motivation to expose instruction-level parallelism (ILP), microprocessor cores have ev...
Summarization: Mapping computational intensive applications on reconfigurable technology for acceler...
Abstract The combination of growing transistor counts and limited power budget within a silicon die ...