We propose a Domain-Specific Architecture for elementary function computation to improve throughput while reducing power consumption as a model for more general applications: support fine-grained parallelism by eliminating branches, and eliminate the duplication required by coprocessors by decomposing computation into instructions which fit existing pipelined execution models and standard register files. Our example instruction architecture (ISA) extension supports scalar and vector/SIMD implementations of table-based methods of calculating all common special functions, with the aim of improving throughput by (1) eliminating the need for tables in memory, (2) eliminating all branches for special cases, and (3) reducing the total number of i...
(eng) This paper presents a new scheme for the hardware evaluation of elementary functions, based on...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
Abstract — This paper presents a methodology for synthesizing customized vector ISAs for various app...
We propose a Domain-Specific Architecture for elementary function com-putation to improve throughput...
Over the past years, a considerable amount of effort has been devoted to the definition and implemen...
Abstract—Elementary functions are extensively used in com-puter graphics, signal and image processin...
In this article, we present an approach for improv-ing the performance of sequences of dependent ins...
(eng) Many general table-based methods for the evaluation in hardware of elementary functions have b...
The relentless push in technology scaling driven by Moore's Law has witnessed fantastic gains in the...
The end of Dennard scaling leads to new research directions that try to cope with the utilization wa...
Hardware specialization has received renewed interest recently as chips are hitting power limits. Ch...
Hardware specialization has received renewed interest recently as chips are hitting power limits. Ch...
Over the past years, a considerable amount of effort has been devoted to the defin-ition and impleme...
This paper presents a new scheme for the hardware evaluation of functions in fixed-point format, for...
The tables-and-additions methods for accurate computation of elementary functions are fast in comput...
(eng) This paper presents a new scheme for the hardware evaluation of elementary functions, based on...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
Abstract — This paper presents a methodology for synthesizing customized vector ISAs for various app...
We propose a Domain-Specific Architecture for elementary function com-putation to improve throughput...
Over the past years, a considerable amount of effort has been devoted to the definition and implemen...
Abstract—Elementary functions are extensively used in com-puter graphics, signal and image processin...
In this article, we present an approach for improv-ing the performance of sequences of dependent ins...
(eng) Many general table-based methods for the evaluation in hardware of elementary functions have b...
The relentless push in technology scaling driven by Moore's Law has witnessed fantastic gains in the...
The end of Dennard scaling leads to new research directions that try to cope with the utilization wa...
Hardware specialization has received renewed interest recently as chips are hitting power limits. Ch...
Hardware specialization has received renewed interest recently as chips are hitting power limits. Ch...
Over the past years, a considerable amount of effort has been devoted to the defin-ition and impleme...
This paper presents a new scheme for the hardware evaluation of functions in fixed-point format, for...
The tables-and-additions methods for accurate computation of elementary functions are fast in comput...
(eng) This paper presents a new scheme for the hardware evaluation of elementary functions, based on...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
Abstract — This paper presents a methodology for synthesizing customized vector ISAs for various app...