Deep bottleneck features (DBNFs) have been used successfully in the past for acoustic speech recognition from audio. However, research on extracting DBNFs for visual speech recognition is very limited. In this work, we present an approach to extract deep bottleneck visual features based on deep autoencoders. To the best of our knowledge, this is the first work that extracts DBNFs for visual speech recognition directly from pixels. We first train a deep autoencoder with a bottleneck layer in order to reduce the dimensionality of the image. Then the autoencoder's decoding layers are replaced by classification layers which make the bottleneck features more discriminative. Discrete Cosine Transform (DCT) features are also appended in the bottle...
Language recognition systems based on bottleneck features have recently become the state-of-the-art ...
In this work, a novel training scheme for generating bottleneck fea-tures from deep neural networks ...
Recently, deep bottleneck features (DBF) extracted from a deep neural network (DNN) containing a nar...
A key problem in spoken language identification (LID) is to design effective representations which a...
As a feed-forward architecture, the recently proposed maxout networks integrate dropout naturally an...
Our previous work has shown that Deep Bottleneck Features (DBF), generated from a well-trained Deep ...
Our previous work has shown that Deep Bottleneck Features (DBF), generated from a well-trained Deep ...
A key problem in spoken language identification (LID) is to design effective representations which a...
A key problem in spoken language identification (LID) is to design effective representations which a...
A key problem in spoken language identification (LID) is to design effective representations which a...
In this work, a novel training scheme for generating bottleneck features from deep neural networks i...
In this paper, the pre-training method based on denoising auto-encoder is investigated and proved to...
Traditional visual speech recognition systems consist of two stages, feature extraction and classifi...
Automatic speech recognition (ASR) permits effective interaction between humans and machines in envi...
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to i...
Language recognition systems based on bottleneck features have recently become the state-of-the-art ...
In this work, a novel training scheme for generating bottleneck fea-tures from deep neural networks ...
Recently, deep bottleneck features (DBF) extracted from a deep neural network (DNN) containing a nar...
A key problem in spoken language identification (LID) is to design effective representations which a...
As a feed-forward architecture, the recently proposed maxout networks integrate dropout naturally an...
Our previous work has shown that Deep Bottleneck Features (DBF), generated from a well-trained Deep ...
Our previous work has shown that Deep Bottleneck Features (DBF), generated from a well-trained Deep ...
A key problem in spoken language identification (LID) is to design effective representations which a...
A key problem in spoken language identification (LID) is to design effective representations which a...
A key problem in spoken language identification (LID) is to design effective representations which a...
In this work, a novel training scheme for generating bottleneck features from deep neural networks i...
In this paper, the pre-training method based on denoising auto-encoder is investigated and proved to...
Traditional visual speech recognition systems consist of two stages, feature extraction and classifi...
Automatic speech recognition (ASR) permits effective interaction between humans and machines in envi...
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to i...
Language recognition systems based on bottleneck features have recently become the state-of-the-art ...
In this work, a novel training scheme for generating bottleneck fea-tures from deep neural networks ...
Recently, deep bottleneck features (DBF) extracted from a deep neural network (DNN) containing a nar...