The Power3 processor is a 64-bit implementation of the PowerPC ™ architecture and is the successor to the Power2 ™ processor for workstations and servers which require high performance floating point capability. The previous processors used Newton-Raphson algorithms for their implementations of divide and square root. The Power3 processor has a longer pipeline latency, which would substantially increase the latency for these instructions. Instead, new algorithms based on power series approximations were developed which provide significantly better performance than the Newton-Raphson algorithm for this processor. This paper describes the algorithms, and then shows how both the series based algorithms and the Newton-Raphson algorithms are aff...
AbstractThis paper describes the implementation of rational approximation algorithms for evaluation ...
Applications in computer graphics, digital signal processing, communication systems, robotics, astro...
Arithmetic Square Root is one of the most complex but nevertheless widely used operations in modern ...
The implementations of division and square root in the FPU's of current microprocessors are bas...
this paper is to clarify and evaluate the implementation tradeoffs at the FPU level, thus enabling d...
In applications as in future MIMO communication systems a massive computation of complex matrix oper...
UnrestrictedDue to the constant advances in VLSI technology and the prevalence of many applications ...
The calculation of square roots is a frequently used operation in control systems of power electroni...
This thesis presents a comparison between implementations of the inverse square root function, using...
This paper describes a study of a class of algorithms for the floating-point divide and square root ...
This work presents fixed point square root algorithms and their implementation. These algorithms are...
International audienceThis paper presents an optimized software implementation of the reciprocal squ...
Square root calculation is a widely used task in real-time control systems especially in those, whic...
Division is one of the basic arithmetic operations supported by every computer system. The operation...
Newton-Raphson and Goldschmidt algorithms can be sped up by using variable latency hardware architec...
AbstractThis paper describes the implementation of rational approximation algorithms for evaluation ...
Applications in computer graphics, digital signal processing, communication systems, robotics, astro...
Arithmetic Square Root is one of the most complex but nevertheless widely used operations in modern ...
The implementations of division and square root in the FPU's of current microprocessors are bas...
this paper is to clarify and evaluate the implementation tradeoffs at the FPU level, thus enabling d...
In applications as in future MIMO communication systems a massive computation of complex matrix oper...
UnrestrictedDue to the constant advances in VLSI technology and the prevalence of many applications ...
The calculation of square roots is a frequently used operation in control systems of power electroni...
This thesis presents a comparison between implementations of the inverse square root function, using...
This paper describes a study of a class of algorithms for the floating-point divide and square root ...
This work presents fixed point square root algorithms and their implementation. These algorithms are...
International audienceThis paper presents an optimized software implementation of the reciprocal squ...
Square root calculation is a widely used task in real-time control systems especially in those, whic...
Division is one of the basic arithmetic operations supported by every computer system. The operation...
Newton-Raphson and Goldschmidt algorithms can be sped up by using variable latency hardware architec...
AbstractThis paper describes the implementation of rational approximation algorithms for evaluation ...
Applications in computer graphics, digital signal processing, communication systems, robotics, astro...
Arithmetic Square Root is one of the most complex but nevertheless widely used operations in modern ...