The possibility of dynamically modifying the computational load of neural models at inference time is crucial for on-device processing, where computational power is limited and time-varying. Established approaches for neural model compression exist, but they provide architecturally static models. In this paper, we investigate the use of early-exit architectures, that rely on intermediate exit branches, applied to large-vocabulary speech recognition. This allows for the development of dynamic models that adjust their computational cost to the available resources and recognition performance. Unlike previous works, besides using pre-trained backbones we also train the model from scratch with an early-exit architecture. Experiments on public da...
<p>For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs)...
Over the past decades, the dominant approach towards building automatic speech recognition (ASR) sys...
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the L...
Training deep neural network based Automatic Speech Recognition (ASR) models often requires thousand...
Many of today's state-of-the-art automatic speech recognition (ASR) systems are based on hybrid hidd...
Convolutional Neural Network-based (CNN) inference is a demanding computational task where a long se...
International audienceAutomatic speech recognition is complementary to language recognition. The lan...
Recent advances in Transformer-based large language models (LLMs) have led to significant performanc...
This paper addresses the challenges of training large neural network models under federated learning...
Real-time speech recognition on mobile and embedded devices is an important application of neural ne...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Despite achieving state-of-the-art performance on many NLP tasks, the high energy cost and long infe...
Deep learning is nowadays considered state-of-the-art technology in many applications thanks to huge...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
End-to-end speech recognition is the problem of mapping raw audio signal all the way to text. In doi...
<p>For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs)...
Over the past decades, the dominant approach towards building automatic speech recognition (ASR) sys...
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the L...
Training deep neural network based Automatic Speech Recognition (ASR) models often requires thousand...
Many of today's state-of-the-art automatic speech recognition (ASR) systems are based on hybrid hidd...
Convolutional Neural Network-based (CNN) inference is a demanding computational task where a long se...
International audienceAutomatic speech recognition is complementary to language recognition. The lan...
Recent advances in Transformer-based large language models (LLMs) have led to significant performanc...
This paper addresses the challenges of training large neural network models under federated learning...
Real-time speech recognition on mobile and embedded devices is an important application of neural ne...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Despite achieving state-of-the-art performance on many NLP tasks, the high energy cost and long infe...
Deep learning is nowadays considered state-of-the-art technology in many applications thanks to huge...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
End-to-end speech recognition is the problem of mapping raw audio signal all the way to text. In doi...
<p>For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs)...
Over the past decades, the dominant approach towards building automatic speech recognition (ASR) sys...
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the L...