Mismatched crowdsourcing was recently proposed as a poten-tial approach to deriving moderately accurate speech transcrip-tions using crowd workers unfamiliar with the language be-ing spoken. In introducing this approach, we demonstrated its promise with the help of an isolated word recovery task for Hindi. However, it remained open whether mismatched crowd sourcing can yield non-trivial accuracy in a continuous speech task. In this work, we focus on this question and demonstrate a word error rate of under 45 % in a large-vocabulary task (again for Hindi). In achieving this, we develop several new tech-niques capable of scaling effectively to continuous speech. We also provide an information theoretic analysis and estimate the amount of info...
Human annotation is still an essential part of modern transcription workflows for digitizing music s...
This paper discusses several technical challenges in using crowdsourcing for distributed correction ...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
AbstractScarcity of resources in under resourced languages may leave these languages behind in race ...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
AbstractMismatched crowdsourcing is a technique for acquiring automatic speech recognizer training d...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
Transcribed speech is an essential resource to develop speech technologies for different languages ...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
In this paper, we investigate different ap-proaches in crowdsourcing transcriptions of Dialectal Ara...
Open Crowdsourcing platforms like Amazon Mechanical Turk provide an attractive solution for process ...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
Human annotation is still an essential part of modern transcription workflows for digitizing music s...
Human annotation is still an essential part of modern transcription workflows for digitizing music s...
This paper discusses several technical challenges in using crowdsourcing for distributed correction ...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
AbstractScarcity of resources in under resourced languages may leave these languages behind in race ...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
AbstractMismatched crowdsourcing is a technique for acquiring automatic speech recognizer training d...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
Transcribed speech is an essential resource to develop speech technologies for different languages ...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
In this paper, we investigate different ap-proaches in crowdsourcing transcriptions of Dialectal Ara...
Open Crowdsourcing platforms like Amazon Mechanical Turk provide an attractive solution for process ...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
Human annotation is still an essential part of modern transcription workflows for digitizing music s...
Human annotation is still an essential part of modern transcription workflows for digitizing music s...
This paper discusses several technical challenges in using crowdsourcing for distributed correction ...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...