We collect and release CrowdSpeech — the first publicly available large-scale dataset of crowdsourced audio transcriptions. e show its applicability on an under-resourced language by constructing VoxDIY — a counterpart of CrowdSpeech for the Russian language
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
following two functionalities: (1) users click on the pronunciation variants of 16 words and the app...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Audio captioning is a novel field of multi-modal translation and it is the task of creating a textua...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Cochl Acoustic Scene Dataset, or CochlScene, is a new acoustic scene dataset whose recordings are fu...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
This paper presents a crowdsourcing-based self-improvement frame-work of vocal activity detection (V...
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
following two functionalities: (1) users click on the pronunciation variants of 16 words and the app...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Audio captioning is a novel field of multi-modal translation and it is the task of creating a textua...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Cochl Acoustic Scene Dataset, or CochlScene, is a new acoustic scene dataset whose recordings are fu...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
This paper presents a crowdsourcing-based self-improvement frame-work of vocal activity detection (V...
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a la...
following two functionalities: (1) users click on the pronunciation variants of 16 words and the app...