Deep neural networks consume an excessive amount of hardware resources, making them difficult to deploy to real-time systems. Previous work in the field of network compression lack the explicit hardware feedback necessary to control the resource constraints imposed by such systems. Furthermore, when the system under discussion is safety-critical, additional constraints must be enforced to make sure that acceptable safety levels are achieved. In this work, we take a reinforcement learning approach with which we evaluate three different compression actions: filter pruning, channel pruning and Tucker decomposition. We found that channel pruning was the most consistent one as it satisfied the constraints specification on five of six test scenar...
Structure pruning is an effective method to compress and accelerate neural networks. While filter an...
Deep neural networks (DNNs) typically have many weights. When errors appear in their weights, which ...
Prior works applied singular value decomposition and dropout compression methods for fully-connected...
In recent years, the model compression technique is very effective for deep neural network compressi...
We consider real-time safety-critical systems that feature closed-loop interactions between the embe...
In recent years, deep neural networks have achieved remarkable results in various artificial intelli...
International audienceNeuromorphic architectures are one of the most promising architectures to sign...
Reinforcement learning (RL) is a broad family of algorithms for training autonomous agents to collec...
This dissertation explores compression techniques for neural networks to enable control of resource ...
Deep neural networks have demonstrated outstanding performance in various fields of machine learning...
Deep neural networks (DNNs) have achieved great success in the field of computer vision. The high re...
The rapidly growing parameter volume of deep neural networks (DNNs) hinders the artificial intellige...
Reinforcement learning (RL) is an effective approach to developing control policies by maximizing th...
In recent years, deep learning models have become popular in the real-time embedded application, but...
The success of overparameterized deep neural networks (DNNs) poses a great challenge to deploy compu...
Structure pruning is an effective method to compress and accelerate neural networks. While filter an...
Deep neural networks (DNNs) typically have many weights. When errors appear in their weights, which ...
Prior works applied singular value decomposition and dropout compression methods for fully-connected...
In recent years, the model compression technique is very effective for deep neural network compressi...
We consider real-time safety-critical systems that feature closed-loop interactions between the embe...
In recent years, deep neural networks have achieved remarkable results in various artificial intelli...
International audienceNeuromorphic architectures are one of the most promising architectures to sign...
Reinforcement learning (RL) is a broad family of algorithms for training autonomous agents to collec...
This dissertation explores compression techniques for neural networks to enable control of resource ...
Deep neural networks have demonstrated outstanding performance in various fields of machine learning...
Deep neural networks (DNNs) have achieved great success in the field of computer vision. The high re...
The rapidly growing parameter volume of deep neural networks (DNNs) hinders the artificial intellige...
Reinforcement learning (RL) is an effective approach to developing control policies by maximizing th...
In recent years, deep learning models have become popular in the real-time embedded application, but...
The success of overparameterized deep neural networks (DNNs) poses a great challenge to deploy compu...
Structure pruning is an effective method to compress and accelerate neural networks. While filter an...
Deep neural networks (DNNs) typically have many weights. When errors appear in their weights, which ...
Prior works applied singular value decomposition and dropout compression methods for fully-connected...