National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). These last years, a certain research effort has been invested in characterizing the feature representations learned by DNNs. In this paper, we focus on convolutional neu-ral networks (CNN) trained for phoneme recognition in French. We report clustering experiments performed on activation maps extracted from the different layers of a CNN comprised of two convolution and sub-sampling layers followed by three dense layers. Our goal was to get insights into phone separability and phonemic categories inferred by the network, and how they vary according to the successive layers. Two directions were explored with both linear and non-linear clustering ...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
Recently, convolutional neural networks (CNNs) have been shown to outperform the standard fully conn...
In this project a model has been created that with an audio sequence as input can classify the langu...
Today's state-of-art in speech recognition involves deep neu-ral networks (DNN). These last years, a...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
A defining problem in spoken language identification (LID) is how to design effective representation...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
Recently, convolutional neural networks (CNNs) have been shown to outperform the standard fully conn...
In this project a model has been created that with an audio sequence as input can classify the langu...
Today's state-of-art in speech recognition involves deep neu-ral networks (DNN). These last years, a...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
A defining problem in spoken language identification (LID) is how to design effective representation...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
International audienceThe popularity of Deep Neural Networks (DNNs) is growing significantly, and so...
Many languages identification (LID) systems rely on language models that use machine learning (ML) a...
Recently, convolutional neural networks (CNNs) have been shown to outperform the standard fully conn...
In this project a model has been created that with an audio sequence as input can classify the langu...