International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ extracted from radio broadcast corpora were used to recognize 45 speakers with a deep convolutional neural network (CNN). The same network was also trained with 62 phonetic parameters to i) see if the resulting confusions were identical to those made by the CNN trained with spectrograms, and ii) understand which acoustic parameters were used by the network. The two networks had identical discrimination results 68% of the time. In 22% of the data, the network trained with spectrograms achieved successful discrimination while the network trained with phonetic parameters failed, and the reverse was found in 10% of the data. We display the releva...
Most of the applications in speech use mel-frequency spectral coefficients (MFSC) as features as the...
A set of recurrent artificial neural networks are used for speech recognition. By representing speec...
We have recently proposed a universal acoustic characterisa-tion to foreign accent recognition, in w...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
Recent advances in real-time magnetic resonance imaging (rtMRI) of the vocal tract provides opportun...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
Deep learning is a technique with artificial intelligent (AI) that simulate humans’ learning behavio...
Most of the applications in speech use mel-frequency spectral coefficients (MFSC) as features as the...
A set of recurrent artificial neural networks are used for speech recognition. By representing speec...
We have recently proposed a universal acoustic characterisa-tion to foreign accent recognition, in w...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
Recent advances in real-time magnetic resonance imaging (rtMRI) of the vocal tract provides opportun...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
Deep learning is a technique with artificial intelligent (AI) that simulate humans’ learning behavio...
Most of the applications in speech use mel-frequency spectral coefficients (MFSC) as features as the...
A set of recurrent artificial neural networks are used for speech recognition. By representing speec...
We have recently proposed a universal acoustic characterisa-tion to foreign accent recognition, in w...