International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ extracted from radio broadcast corpora were used to recognize 45 speakers with a deep convolutional neural network (CNN). The same network was also trained with 62 phonetic parameters to i) see if the resulting confusions were identical to those made by the CNN trained with spectrograms, and ii) understand which acoustic parameters were used by the network. The two networks had identical discrimination results 68% of the time. In 22% of the data, the network trained with spectrograms achieved successful discrimination while the network trained with phonetic parameters failed, and the reverse was found in 10% of the data. We display the releva...
Recent advances in real-time magnetic resonance imaging (rtMRI) of the vocal tract provides opportun...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Today's state-of-art in speech recognition involves deep neu-ral networks (DNN). These last years, a...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
Recent advances in real-time magnetic resonance imaging (rtMRI) of the vocal tract provides opportun...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Today's state-of-art in speech recognition involves deep neu-ral networks (DNN). These last years, a...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...
Recent advances in real-time magnetic resonance imaging (rtMRI) of the vocal tract provides opportun...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
Is it possible to train a neural network to accurately detect the features of a human voice such as ...