The amounts of data that need to be transmitted, processed, and stored by the modern deep neural networks have reached truly enormous volumes in the last few years calling for the invention of new paradigms both in hardware and software development. One of the most promising and rapidly advancing frontiers here is the creation of new numerical formats. In this work we focus on the family of block floating point numerical formats due to their combination of wide dynamic range, numerical accuracy, and efficient hardware implementation of inner products using simple integer arithmetic. These formats are characterized by a block of mantissas with a shared scale factor. The basic Block Floating Point (BFP) format quantizes the block scales into ...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
The present work investigates the significance of arithmetic precision in neural network simulation....
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
The heavy burdens of computation and off-chip traffic impede deploying the large scale convolution n...
A special case of floating point data representation is block floating point format where a block of...
The unprecedented growth in DNN model complexity, size and the amount of training data have led to a...
Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep...
Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) in...
Understanding the bit-width precision is critical in compact representation of a Deep Neural Network...
International audienceDeep Neural Networks (DNN) represent a performance-hungry application. Floatin...
Approximate computing has emerged as a promising approach to energy-efficient design of digital syst...
An important aspect of modern automation is machine learning. Specifically, neural networks are used...
Specialized hardware implementations of Artificial Neural Networks (ANNs) can offer faster execution...
In recent years, neural networks have grown in popularity, mostly thanks to the advances in the fiel...
International audienceThe numerical format used for representing weights and activations plays a key...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
The present work investigates the significance of arithmetic precision in neural network simulation....
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...
The heavy burdens of computation and off-chip traffic impede deploying the large scale convolution n...
A special case of floating point data representation is block floating point format where a block of...
The unprecedented growth in DNN model complexity, size and the amount of training data have led to a...
Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep...
Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) in...
Understanding the bit-width precision is critical in compact representation of a Deep Neural Network...
International audienceDeep Neural Networks (DNN) represent a performance-hungry application. Floatin...
Approximate computing has emerged as a promising approach to energy-efficient design of digital syst...
An important aspect of modern automation is machine learning. Specifically, neural networks are used...
Specialized hardware implementations of Artificial Neural Networks (ANNs) can offer faster execution...
In recent years, neural networks have grown in popularity, mostly thanks to the advances in the fiel...
International audienceThe numerical format used for representing weights and activations plays a key...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
The present work investigates the significance of arithmetic precision in neural network simulation....
International audienceGraphics Processing Units (GPUs) offer the possibility to execute floating-poi...