none3siIn this work we propose a shared floating point unit (FPU) architecture for ultra low power (ULP) system on chips operating at near threshold voltage (NTV). Since high-performance FP units (FPUs) are large and complex, but their utilization is relatively low, adding one FPU per each core in a ULP multicore is costly and power hungry. In our approach, we share a few FPUs among all the cores in the system. This increases the utilization of FPUs leading to an energy-efficient design. As a part of our approach, we propose two different FPU allocation techniques: optimal and random. Experimental results demonstrate that compared to a traditional private-FPU approach, our technique in a multicore system with 8 processors and 2 shared FPUs ...
The slowdown of Moore's law and the power wall necessitates a shift toward finely tunable precision ...
Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as dist...
International audienceNext generation FPGA circuits will allow the integration of dozens of hard and...
Currently, the most powerful supercomputers can provide tens of petaflops. Future many-core systems ...
In the Internet-Of-Things (IoT) domain, microcontrollers (MCUs) are used to collect and process data...
Abstract—Energy-efficient computation is critical if we are going to continue to scale performance i...
Abstract—With increased densities on chips and the growing popularity of multicore processors and ge...
Data-parallel problems demand ever growing floating-point (FP) operations per second under tight are...
The stringent power constraints of complex microcontroller based devices (e.g. smart sensors for the...
Several studies and recent real world designs have promoted sharing of underutilized resources betwe...
The stringent power constraints of complex microcontroller based devices (e.g. smart sensors for the...
In modern low-power embedded platforms, floating-point (FP) operations emerge as a major contributor...
none3siThe OpenRISC processor core, featuring a flat pipeline and a low area footprint has been inte...
Designing special-purpose processors and ASICs to execute computer programs requires a methodology t...
Modern embedded systems are in charge of an increasing number of tasks that extensively...
The slowdown of Moore's law and the power wall necessitates a shift toward finely tunable precision ...
Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as dist...
International audienceNext generation FPGA circuits will allow the integration of dozens of hard and...
Currently, the most powerful supercomputers can provide tens of petaflops. Future many-core systems ...
In the Internet-Of-Things (IoT) domain, microcontrollers (MCUs) are used to collect and process data...
Abstract—Energy-efficient computation is critical if we are going to continue to scale performance i...
Abstract—With increased densities on chips and the growing popularity of multicore processors and ge...
Data-parallel problems demand ever growing floating-point (FP) operations per second under tight are...
The stringent power constraints of complex microcontroller based devices (e.g. smart sensors for the...
Several studies and recent real world designs have promoted sharing of underutilized resources betwe...
The stringent power constraints of complex microcontroller based devices (e.g. smart sensors for the...
In modern low-power embedded platforms, floating-point (FP) operations emerge as a major contributor...
none3siThe OpenRISC processor core, featuring a flat pipeline and a low area footprint has been inte...
Designing special-purpose processors and ASICs to execute computer programs requires a methodology t...
Modern embedded systems are in charge of an increasing number of tasks that extensively...
The slowdown of Moore's law and the power wall necessitates a shift toward finely tunable precision ...
Ultra-low power computing is a key enabler of deeply embedded platforms used in domains such as dist...
International audienceNext generation FPGA circuits will allow the integration of dozens of hard and...