Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them are well-resourced, thus limiting typological investigations, i.e., language-comparison studies aiming at understanding universal trends in language. Crowd-sourced data could participate in creating homogenous multilingual corpora and therefore provide a revolutionary tool to give researchers access to large amounts of data in rare or remote languages. Yet crowd-sourced data are usually recorded with non-professional tools in non-silent environments, which represents a challenge to anyone wishing to use them for phonetic research. In this paper, we show how crowd-sourced data can participate in academic research by using audio files from Ling...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
Less-resourced languages are usually left out of phonetic studies based on large corpora. We contrib...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
Less-resourced languages are usually left out of phonetic studies based on large corpora. We contrib...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceLess-resourced languages are usually left out of phonetic studies based on lar...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceData-driven research in phonetics and phonology relies massively on oral resou...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...