Over the past decades, the dominant approach towards building automatic speech recognition (ASR) systems has been a complex combination of separately optimized pre-processing, acoustic model and language model components. The recently proposed end-to-end models for ASR present a significant simplification over conventional ASR systems. End-to-end models transcribe input speech to output text with a single neural network that is optimized in a single training stage. While the single model and training stage are a welcome simplification of the ASR system, they are also mostly incompatible with past research that went into optimizing the separate components of conventional ASR systems. Furthermore, the monolithic neural network structure in en...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic kn...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Recent work on neural networks with probabilistic parameters has shown that parameter uncertainty im...
Training deep neural network based Automatic Speech Recognition (ASR) models often requires thousand...
Training domain-specific automatic speech recognition (ASR) systems requires a suitable amount of da...
It is well known that additive noise can cause a significant decrease in performance for an automati...
As a result of advancement in deep learning and neural network technology, end-to-end models have be...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The problem of finding criteria through which a model will be chosen to match problems and available...
Over these last few years, the use of Artificial Neural Networks (ANNs), now often referred to as de...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
We introduce a new paradigm for Robust Automatic Speech Recognition that directly incorporates infor...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic kn...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Recent work on neural networks with probabilistic parameters has shown that parameter uncertainty im...
Training deep neural network based Automatic Speech Recognition (ASR) models often requires thousand...
Training domain-specific automatic speech recognition (ASR) systems requires a suitable amount of da...
It is well known that additive noise can cause a significant decrease in performance for an automati...
As a result of advancement in deep learning and neural network technology, end-to-end models have be...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The problem of finding criteria through which a model will be chosen to match problems and available...
Over these last few years, the use of Artificial Neural Networks (ANNs), now often referred to as de...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
We introduce a new paradigm for Robust Automatic Speech Recognition that directly incorporates infor...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
<p>For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonem...
Speech recognition has become common in many application domains. Incorporating acoustic-phonetic kn...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...