International audienceA deep convolutional neural network was trained to classify 45 speakers based on spectrograms of their productions of the French vowel /ɑ̃/ Although the model achieved fairly high accuracy – over 85 % – our primary focus here was phonetic interpretability rather than sheer performance. In order to better understand what kind of representations were learned by the model, i) several versions of the model were trained and tested with low-pass filtered spectrograms with a varying cut-off frequency and ii) classification was also performed with masked frequency bands. The resulting decline in accuracy was utilized to spot relevant frequencies for speaker classification and voice comparison, and to produce phonetically inter...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
Senior Project submitted to The Division of Science, Mathematics and Computing of Bard College. Abst...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceApart from the impressive performance it has achieved in several tasks, one of...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
In this paper, we investigate the connection between how people understand speech and how speech is ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
International audienceWe present the Perceptimatic English Benchmark, an open experimental benchmark...
This paper discusses a transition from the traditional methods to novel deep learning architectures ...
Abstract — Speech Recognition is the translation of spoken words into text. Speech recognition invol...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
Senior Project submitted to The Division of Science, Mathematics and Computing of Bard College. Abst...
International audienceA deep convolutional neural network was trained to classify 45 speakers based ...
International audienceBroadband spectrograms of French vowels /Ã/, /a/, /E/, /e/, /i/, /@/, and /O/ ...
International audienceApart from the impressive performance it has achieved in several tasks, one of...
The field of artificial intelligence (AI) has long found that it is the things that humans find very...
In this paper, we investigate the connection between how people understand speech and how speech is ...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
This paper provides a comprehensive analysis of the effect of speaking rate on frame classification ...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
International audienceWe present the Perceptimatic English Benchmark, an open experimental benchmark...
This paper discusses a transition from the traditional methods to novel deep learning architectures ...
Abstract — Speech Recognition is the translation of spoken words into text. Speech recognition invol...
Deep learning models have improved cutting-edge technologies in many research areas, but their black...
National audienceToday's state-of-art in speech recognition involves deep neu-ral networks (DNN). Th...
Senior Project submitted to The Division of Science, Mathematics and Computing of Bard College. Abst...