This paper presents an improved transfer learning framework applied to robust personalised speech recognition models for speakers with dysarthria. As the baseline of transfer learning, a state-of-theart CNN-TDNN-F ASR acoustic model trained solely on source domain data is adapted onto the target domain via neural network weight adaptation with the limited available data from target dysarthric speakers. Results show that linear weights in neural layers play the most important role for an improved modelling of dysarthric speech evaluated using UASpeech corpus, achieving averaged 11.6% and 7.6% relative recognition improvement in comparison to the conventional speaker-dependent training and data combination, respectively. To further improve th...
The final publication is available at https://link.springer.com/chapter/10.1007%2F978-3-319-49169-1_...
Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy o...
Gemmeke J.F., Sehgal S., Cunningham S., Van hamme H., ''Dysarthric vocal interfaces with minimal tra...
This paper presents an improved transfer learning framework applied to robust personalised speech re...
This thesis explores deep learning techniques to improve Automatic Speech Recognition (ASR) for peop...
Dysarthria is a neurological speech disorder, which exhibits multi-fold disturbances in the speech p...
Millions of individuals have acquired or have been born with neuro-motor conditions that limit the c...
Millions of individuals have acquired or have been born with neuro-motor conditions that limit the c...
This work addresses the mismatch problem between the distribution of training data (source) and test...
This work addresses the mismatch problem between the distribution of training data (source) and test...
This work addresses the mismatch problem between the distribution of training data (source) and test...
There has been much recent interest in building continuous speech recognition systems for people wi...
In light of steady progress in machine learning, automatic speech recognition (ASR) is entering more...
Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy o...
The final publication is available at https://link.springer.com/chapter/10.1007%2F978-3-319-49169-1_...
The final publication is available at https://link.springer.com/chapter/10.1007%2F978-3-319-49169-1_...
Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy o...
Gemmeke J.F., Sehgal S., Cunningham S., Van hamme H., ''Dysarthric vocal interfaces with minimal tra...
This paper presents an improved transfer learning framework applied to robust personalised speech re...
This thesis explores deep learning techniques to improve Automatic Speech Recognition (ASR) for peop...
Dysarthria is a neurological speech disorder, which exhibits multi-fold disturbances in the speech p...
Millions of individuals have acquired or have been born with neuro-motor conditions that limit the c...
Millions of individuals have acquired or have been born with neuro-motor conditions that limit the c...
This work addresses the mismatch problem between the distribution of training data (source) and test...
This work addresses the mismatch problem between the distribution of training data (source) and test...
This work addresses the mismatch problem between the distribution of training data (source) and test...
There has been much recent interest in building continuous speech recognition systems for people wi...
In light of steady progress in machine learning, automatic speech recognition (ASR) is entering more...
Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy o...
The final publication is available at https://link.springer.com/chapter/10.1007%2F978-3-319-49169-1_...
The final publication is available at https://link.springer.com/chapter/10.1007%2F978-3-319-49169-1_...
Speech production errors characteristic of dysarthria are chiefly responsible for the low accuracy o...
Gemmeke J.F., Sehgal S., Cunningham S., Van hamme H., ''Dysarthric vocal interfaces with minimal tra...