In this paper, we present a methodology for linguistic feature extraction, focusing particularly on automatically syllabifying words in multiple languages, with a design to be compatible with a forced-alignment tool, the Montreal Forced Aligner (MFA). In both the textual and phonetic domains, our method focuses on the extraction of phonetic transcriptions from text, stress marks, and a unified automatic syllabification (in text and phonetic domains). The system was built with open-source components and resources. Through an ablation study, we demonstrate the efficacy of our approach in automatically syllabifying words from several languages (English, French and Spanish). Additionally, we apply the technique to the transcriptions of the CMU ...
International audienceAutomatic phonemic transcription tools now reach high levels of accuracy on a ...
Over the past few years, self-supervised learned speech representations have emerged as fruitful rep...
This paper presents trainable methods for generating letter to sound rules from a given lexicon for ...
This paper presents a state-of-the-art model for transcribing speech in any language into the Intern...
In this paper, we introduce a massively multilingual speech corpora with fine-grained phonemic trans...
In this paper we present a statistical approach for the automatic syllabification of phonetic word t...
We present Maestro, a self-supervised training method to unify representations learnt from speech an...
Forced alignment, a speech recognition software performing semi-automatic phonological transcription...
Rapid deployment of automatic speech recognition (ASR) in new languages, with very limited data, is ...
The first step of most acoustic analyses unavoidably involves the alignment of recorded speech soun...
International audienceSPPAS, SPeech Phonetization Alignment and Syllabification, is a tool to automa...
affiliation: Castellini, C (Reprint Author), Univ Genoa, LIRA Lab, Genoa, Italy. Castellini, Claudio...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and...
Forced alignment automatically aligns audio recordings of spoken language with transcripts at the se...
International audienceAutomatic phonemic transcription tools now reach high levels of accuracy on a ...
Over the past few years, self-supervised learned speech representations have emerged as fruitful rep...
This paper presents trainable methods for generating letter to sound rules from a given lexicon for ...
This paper presents a state-of-the-art model for transcribing speech in any language into the Intern...
In this paper, we introduce a massively multilingual speech corpora with fine-grained phonemic trans...
In this paper we present a statistical approach for the automatic syllabification of phonetic word t...
We present Maestro, a self-supervised training method to unify representations learnt from speech an...
Forced alignment, a speech recognition software performing semi-automatic phonological transcription...
Rapid deployment of automatic speech recognition (ASR) in new languages, with very limited data, is ...
The first step of most acoustic analyses unavoidably involves the alignment of recorded speech soun...
International audienceSPPAS, SPeech Phonetization Alignment and Syllabification, is a tool to automa...
affiliation: Castellini, C (Reprint Author), Univ Genoa, LIRA Lab, Genoa, Italy. Castellini, Claudio...
We explore different ways of "spelling" a word in a speech recognizer's lexicon and h...
Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and...
Forced alignment automatically aligns audio recordings of spoken language with transcripts at the se...
International audienceAutomatic phonemic transcription tools now reach high levels of accuracy on a ...
Over the past few years, self-supervised learned speech representations have emerged as fruitful rep...
This paper presents trainable methods for generating letter to sound rules from a given lexicon for ...