International audienceLess-resourced languages are usually left out of phonetic studies based on large corpora. We contribute to the recent efforts to fill this gap by assessing how to use open-access, crowd-sourced audio data from Lingua Libre for phonetic research. Lingua Libre is a participative linguistic library developed by Wikimedia France in 2015. It contains more than 670k recordings in approximately 150 languages across nearly 740 speakers. As a proof of concept, we consider the Inventory Size Hypothesis, which predicts that, in a given system, variation in the realization of each vowel will be inversely related to the number of vowel categories. We investigate data from 10 languages with various numbers of vowel categories, i.e.,...
ABSTRACT French is a language spoken by hundreds of millions of speakers in Europe, Africa, and Amer...
Current studies in phonological and phonetic variation make an ever increasing use of large oral cor...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
This repository contains the data gathered within the research project “At the intersection between ...
International audienceThis study aims at renewing traditional dialectological atlases to provide a m...
A major hurdle in data-driven research on typology is having sufficient data in many languages to dr...
following two functionalities: (1) users click on the pronunciation variants of 16 words and the app...
International audienceFormant values of oral vowels are automatically measured in a total of 50000 s...
International audienceThe present research addresses the question whether an automated analysis of l...
ABSTRACT French is a language spoken by hundreds of millions of speakers in Europe, Africa, and Amer...
Current studies in phonological and phonetic variation make an ever increasing use of large oral cor...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
This repository contains the data gathered within the research project “At the intersection between ...
International audienceThis study aims at renewing traditional dialectological atlases to provide a m...
A major hurdle in data-driven research on typology is having sufficient data in many languages to dr...
following two functionalities: (1) users click on the pronunciation variants of 16 words and the app...
International audienceFormant values of oral vowels are automatically measured in a total of 50000 s...
International audienceThe present research addresses the question whether an automated analysis of l...
ABSTRACT French is a language spoken by hundreds of millions of speakers in Europe, Africa, and Amer...
Current studies in phonological and phonetic variation make an ever increasing use of large oral cor...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...