. Configurable Arithmetic Logic Units (ALUs) offer opportunities for adapting the underlying hardware to the computation for efficiency. The problem of identifying the optimal configurations at different steps in a program is a very complex issue but allows the power of these ALUs to be maximally used if solved. This paper focuses on developing an automatic compilation framework for exploiting operator parallelism within loop nests. The focus of this analysis is on identifying and maximally using configurations to avoid costly reconfiguration overheads. In our framework, initially some operator and loop transformations are carried out to expose more opportunities for configuration reuse. We then present a two pass solution. The first pass a...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Current High-Level Synthesis (HLS) tools perform excellently for the synthesis of computation kernel...
. Reconfigurable circuits and systems have evolved from application specific accelerators to a gener...
Reconfigurable circuits and systems have evolved from application specific accelerators to a general...
Special issue on Microgrids. %HEVEA\publinkGVBCPST06.ps.gzInternational audienceModern compilers are...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
Compiling for parallelism is a longstanding topic of compiler research. This book describes the fund...
Dynamic hardware generation reduces the number of FPGA resources needed and speeds up an application...
This work leverages an original dependency analysis to parallelize loops regardless of their form i...
Abstract. State-of-the-art behavioral synthesis tools for reconfigurable architectures barely have h...
International audienceThis paper presents the method developed in the architecture compiler SCOOP: h...
Runtime reconfiguration provides an efficient means to reduce the hardware cost, while satisfying th...
The constant evolution of processors architectures, with superscalar, instruction-level parallelism,...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Current High-Level Synthesis (HLS) tools perform excellently for the synthesis of computation kernel...
. Reconfigurable circuits and systems have evolved from application specific accelerators to a gener...
Reconfigurable circuits and systems have evolved from application specific accelerators to a general...
Special issue on Microgrids. %HEVEA\publinkGVBCPST06.ps.gzInternational audienceModern compilers are...
Over the past 20 years, increases in processor speed have dramatically outstripped performance incre...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/18...
Compiling for parallelism is a longstanding topic of compiler research. This book describes the fund...
Dynamic hardware generation reduces the number of FPGA resources needed and speeds up an application...
This work leverages an original dependency analysis to parallelize loops regardless of their form i...
Abstract. State-of-the-art behavioral synthesis tools for reconfigurable architectures barely have h...
International audienceThis paper presents the method developed in the architecture compiler SCOOP: h...
Runtime reconfiguration provides an efficient means to reduce the hardware cost, while satisfying th...
The constant evolution of processors architectures, with superscalar, instruction-level parallelism,...
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Over the past decade, microprocessor design strategies have focused on increasing the computational ...
Current High-Level Synthesis (HLS) tools perform excellently for the synthesis of computation kernel...