International audienceAutomatic speech recognition is complementary to language recognition. The language recognition systems exploit this complementarity by using frame-level bottleneck features extracted from neural networks trained with a phone recognition task. Recent methods apply frame-level bottleneck features extracted from an end-to-end sequence-to-sequence speech recognition model. In this work, we study an integrated approach of the training of the speech recognition feature extractor and language recognition modules. We show that for both classical phone recognition and end-to-end sequence-to-sequence features, sequential training of the two modules is not the optimal strategy. The feature extractor can be improved by supervisio...
In the phonotactic approach for language recognition, a phone tokeniser is normally used to transfo...
In this work, a novel training scheme for generating bottleneck features from deep neural networks i...
© 2016 IEEE. In recent years, research on language modeling for speech recognition has increasingly ...
Language recognition systems based on bottleneck features have recently become the state-of-the-art ...
The development of a speech recognition system requires at least three resources: a large labeled sp...
In this paper, we investigate phone sequence modeling with recurrent neural networks in the context ...
Recently, deep bottleneck features (DBF) extracted from a deep neural network (DNN) containing a nar...
This work investigates features derived from an artificial neural network. These artificial neural n...
In this work, a novel training scheme for generating bottleneck fea-tures from deep neural networks ...
This paper proposes a new phone lattice based method for automatic language recognition from speech ...
This paper presents a unified i-vector framework for language identification (LID) based on deep bot...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
We study two key issues in task-independent training, namely selection of a universal set of subword...
The first part of this thesis focuses on very low-dimensional bottleneck features (BNFs), extracted ...
This paper presents the application of Neural Network Bot-tleneck (BN) features in Language Identifi...
In the phonotactic approach for language recognition, a phone tokeniser is normally used to transfo...
In this work, a novel training scheme for generating bottleneck features from deep neural networks i...
© 2016 IEEE. In recent years, research on language modeling for speech recognition has increasingly ...
Language recognition systems based on bottleneck features have recently become the state-of-the-art ...
The development of a speech recognition system requires at least three resources: a large labeled sp...
In this paper, we investigate phone sequence modeling with recurrent neural networks in the context ...
Recently, deep bottleneck features (DBF) extracted from a deep neural network (DNN) containing a nar...
This work investigates features derived from an artificial neural network. These artificial neural n...
In this work, a novel training scheme for generating bottleneck fea-tures from deep neural networks ...
This paper proposes a new phone lattice based method for automatic language recognition from speech ...
This paper presents a unified i-vector framework for language identification (LID) based on deep bot...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
We study two key issues in task-independent training, namely selection of a universal set of subword...
The first part of this thesis focuses on very low-dimensional bottleneck features (BNFs), extracted ...
This paper presents the application of Neural Network Bot-tleneck (BN) features in Language Identifi...
In the phonotactic approach for language recognition, a phone tokeniser is normally used to transfo...
In this work, a novel training scheme for generating bottleneck features from deep neural networks i...
© 2016 IEEE. In recent years, research on language modeling for speech recognition has increasingly ...