Recently proposed techniques like mini-graphs, CCA-subgraphs, and static strands exploit application-specific compound or fused instructions to reduce execution time, energy consumption, and/or processor complexity. To achieve their full potential, these techniques rely on static tools to identify common instruction sequences that make good fusion candidates. As a result, they also rely on ISA extension facilities that can encode these chosen instruction groups in a way that supports efficient execution on fusion-enabled hardware as well as compatibility across different implementations, including fusion-agnostic implementations. This paper describes handle prefix outlining, the ISA extension scheme used by mini-graph processors. Handle pre...
Data-parallel applications, such as data analytics, machine learning, and scientific computing, are ...
The automatic generation of instruction set extensions to provide application-specific acceleration ...
The use of special instructions that execute complex oper-ation patterns is a common approach in app...
Recently proposed techniques like mini-graphs, CCA-subgraphs, and static strands exploit application...
For years, single-thread performance was the most dominant force driving processor development. In r...
Abstract As the complexity of digital hardware grows steadily, so does the demand of high level mode...
A mini-graph is a dataflow graph that has an arbitrary internal size and shape but the interface of ...
In application-specific processor design, a common approach to improve performance and efficiency is...
This paper explores hardware specialization of low power processors to improve performance and ener...
International audienceThe Complex Instruction Set Computer (CISC) paradigm has led to the introducti...
The automatic generation of instruction set extensions (ISEs) to provide application-specific accele...
The end of Dennard scaling leads to new research directions that try to cope with the utilization wa...
In this paper, we address the problem of generating an optimal instruction sequence S for a Directed...
The use of special instructions that execute complex operation patterns is a common approach in appl...
The information and communication technology (ICT) sector is consuming an increasing proportion of g...
Data-parallel applications, such as data analytics, machine learning, and scientific computing, are ...
The automatic generation of instruction set extensions to provide application-specific acceleration ...
The use of special instructions that execute complex oper-ation patterns is a common approach in app...
Recently proposed techniques like mini-graphs, CCA-subgraphs, and static strands exploit application...
For years, single-thread performance was the most dominant force driving processor development. In r...
Abstract As the complexity of digital hardware grows steadily, so does the demand of high level mode...
A mini-graph is a dataflow graph that has an arbitrary internal size and shape but the interface of ...
In application-specific processor design, a common approach to improve performance and efficiency is...
This paper explores hardware specialization of low power processors to improve performance and ener...
International audienceThe Complex Instruction Set Computer (CISC) paradigm has led to the introducti...
The automatic generation of instruction set extensions (ISEs) to provide application-specific accele...
The end of Dennard scaling leads to new research directions that try to cope with the utilization wa...
In this paper, we address the problem of generating an optimal instruction sequence S for a Directed...
The use of special instructions that execute complex operation patterns is a common approach in appl...
The information and communication technology (ICT) sector is consuming an increasing proportion of g...
Data-parallel applications, such as data analytics, machine learning, and scientific computing, are ...
The automatic generation of instruction set extensions to provide application-specific acceleration ...
The use of special instructions that execute complex oper-ation patterns is a common approach in app...