With the increasing popularity of deep learning approaches in the field of speech recognition and classification many of such problems are encountering a paradigm shift from classic approaches, such as hidden Markov models, to recurrent neural networks (RNN). In this paper we are going to examine that transition for the ALC corpus which had been used in the Interspeech 2011 Speaker State Challenge. Filter bank (FBANK) features are used alongside two types of bidirectional RNNs, each using gated recurrent units (GRU). Those models are used to classify the intoxication state of people just by recordings of their voices and outperform humans with state-of-the-art results
Speech Recognition is correctly transcribing the spoken utterances by the machine. A new area that i...
Speaker identification systems perform almost ideally in neutral talking environments. However, thes...
One possible explanation for RNN language models' outsized effectiveness in voice recognition is its...
Driving under the influence is one of the largest risk factors leading to accidents. Intoxication ma...
Speaker state recognition is a challenging problem due to speaker and context variability. Intoxicat...
This paper focuses on the automatic detection of a person’s blood level alcohol based on automatic s...
Recurrent neural network language models (RNNLMs) are powerful language modeling techniques. Signifi...
This paper analyzes the human performance of recognizing drunk speakers merely by voice and compares...
The ALC sub-challenge of the Interspeech Speaker State Chal-lenge (ISSC) aims at the automatic class...
End-to-end speech recognition is the problem of mapping raw audio signal all the way to text. In doi...
International audienceArchitectures of Recurrent Neural Networks (RNN) recently become a very popula...
In this paper, we investigate multiple approaches for automatically detecting intoxicated speakers g...
Speech recognition is far from an easy task, because of the unique nature of the human emotional tex...
Speaker identification systems perform almost ideally in neutral talking environments. However, thes...
The fact that an increasing number of functions in the automobile are and will be controlled by spee...
Speech Recognition is correctly transcribing the spoken utterances by the machine. A new area that i...
Speaker identification systems perform almost ideally in neutral talking environments. However, thes...
One possible explanation for RNN language models' outsized effectiveness in voice recognition is its...
Driving under the influence is one of the largest risk factors leading to accidents. Intoxication ma...
Speaker state recognition is a challenging problem due to speaker and context variability. Intoxicat...
This paper focuses on the automatic detection of a person’s blood level alcohol based on automatic s...
Recurrent neural network language models (RNNLMs) are powerful language modeling techniques. Signifi...
This paper analyzes the human performance of recognizing drunk speakers merely by voice and compares...
The ALC sub-challenge of the Interspeech Speaker State Chal-lenge (ISSC) aims at the automatic class...
End-to-end speech recognition is the problem of mapping raw audio signal all the way to text. In doi...
International audienceArchitectures of Recurrent Neural Networks (RNN) recently become a very popula...
In this paper, we investigate multiple approaches for automatically detecting intoxicated speakers g...
Speech recognition is far from an easy task, because of the unique nature of the human emotional tex...
Speaker identification systems perform almost ideally in neutral talking environments. However, thes...
The fact that an increasing number of functions in the automobile are and will be controlled by spee...
Speech Recognition is correctly transcribing the spoken utterances by the machine. A new area that i...
Speaker identification systems perform almost ideally in neutral talking environments. However, thes...
One possible explanation for RNN language models' outsized effectiveness in voice recognition is its...