This thesis focuses on the development of neural network acoustic models for large vocabulary continuous speech recognition (LVCSR) to satisfy the design goals of low latency and low computational complexity. Low latency enables online speech recognition; and low computational complexity helps reduce the computational cost both during training and inference. Long span sequential dependencies and sequential distortions in the input vector sequence are a major challenge in acoustic modeling. Recurrent neural networks have been shown to effectively model these dependencies. Specifically, bidirectional long short term memory (BLSTM) networks, provide state-of-the-art performance across several LVCSR tasks. However the deployment of bidirection...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
Real-time speech recognition on mobile and embedded devices is an important application of neural ne...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
: In this paper, we review the research work that deal with neural network based speech recognition ...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional...
Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However, the ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However, the ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
This paper describes a Deep Belief Neural Network (DBNN) and Bidirectional Long-Short Term Memory (L...
Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designe...
Abstract Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. Howe...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
Real-time speech recognition on mobile and embedded devices is an important application of neural ne...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
: In this paper, we review the research work that deal with neural network based speech recognition ...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Long short-term memory (LSTM) has been effectively used to represent sequential data in recent years...
Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional...
Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However, the ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However, the ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
This paper describes a Deep Belief Neural Network (DBNN) and Bidirectional Long-Short Term Memory (L...
Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designe...
Abstract Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. Howe...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
Real-time speech recognition on mobile and embedded devices is an important application of neural ne...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...