Reduced-precision floating-point (FP) arithmetic is being widely adopted to reduce memory footprint and execution time on battery-powered Internet of Things (IoT) end-nodes. However, reduced precision computations must meet end-do-end precision constraints to be acceptable at the application level. This work introduces TransLib 1 1 https://github.com/ahmad-mirsalari/TransLib, an open-source kernel library based on transprecision computing principles, which provides knobs to exploit different FP data types (i.e., float, float16, and bfloat16), also considering the trade-off between homogeneous and mixed-precision solutions. We demonstrate the capabilities of the proposed library on PULP, a 32-bit microcontroller (MCU) coupled with a parallel...
Small, low-cost IoT devices rely on floating-point (FP) software emulation on 32-bit integer cores w...
This paper presents the design and the implementation of a fully combinatorial floating point unit (...
In the Internet-Of-Things (IoT) domain, microcontrollers (MCUs) are used to collect and process data...
In modern low-power embedded platforms, floating-point (FP) operations emerge as a major contributor...
Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as dist...
International audienceIn recent years, Coarse Grain Reconfigurable Architecture (CGRA) accelerators ...
The slowdown of Moore's law and the power wall necessitates a shift toward finely tunable precision ...
For many years, computing systems rely on guaranteed numerical precision of each step in complex com...
International audienceFull-precision Floating-Point Units (FPUs) can be a source of extensive hardwa...
In recent years approximate computing has been extensively explored as a paradigm to design hardware...
The crisis of Moore's law and new dominant Machine Learning workloads require a paradigm shift towar...
The datasets have been collected by benchmarking three algorithms for Transprecision Computing (Corr...
Small, low-cost IoT devices rely on floating-point (FP) software emulation on 32-bit integer cores w...
This paper presents the design and the implementation of a fully combinatorial floating point unit (...
In the Internet-Of-Things (IoT) domain, microcontrollers (MCUs) are used to collect and process data...
In modern low-power embedded platforms, floating-point (FP) operations emerge as a major contributor...
Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as dist...
International audienceIn recent years, Coarse Grain Reconfigurable Architecture (CGRA) accelerators ...
The slowdown of Moore's law and the power wall necessitates a shift toward finely tunable precision ...
For many years, computing systems rely on guaranteed numerical precision of each step in complex com...
International audienceFull-precision Floating-Point Units (FPUs) can be a source of extensive hardwa...
In recent years approximate computing has been extensively explored as a paradigm to design hardware...
The crisis of Moore's law and new dominant Machine Learning workloads require a paradigm shift towar...
The datasets have been collected by benchmarking three algorithms for Transprecision Computing (Corr...
Small, low-cost IoT devices rely on floating-point (FP) software emulation on 32-bit integer cores w...
This paper presents the design and the implementation of a fully combinatorial floating point unit (...
In the Internet-Of-Things (IoT) domain, microcontrollers (MCUs) are used to collect and process data...