A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a media sharing platform may view real-time transcriptions associated with media on the user devices and identify the transcriptions as correct or incorrect. Users may determine with high accuracy correct and incorrect parts of transcribed text, using a general context of a conversation that is being transcribed. The users may select or mark blocks of transcription text and assign the selected text as correct transcription or incorrect transcription on the input user device. The system may aggregate a large amount of marked transcriptions from multiple user devices and store the marked transcriptions. The stored marked transcriptions may be us...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
Speech interfaces provide a natural and accessible mechanism allowing users that cannot read to inte...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Automatic speech transcription systems are developed for various languages, domains,and applications...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Managing a large-scale speech transcription task with a team of human transcribers requires effecti...
Effective operation of a voice-activated virtual assistant requires accurate speech recognition. Man...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Machine generated speech transcriptions are a feature of several products such as videoconferencing ...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
Speech interfaces provide a natural and accessible mechanism allowing users that cannot read to inte...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Automatic speech transcription systems are developed for various languages, domains,and applications...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Managing a large-scale speech transcription task with a team of human transcribers requires effecti...
Effective operation of a voice-activated virtual assistant requires accurate speech recognition. Man...
This paper presents a method for reducing the effort of transcribing user utterances to develop lang...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
Machine generated speech transcriptions are a feature of several products such as videoconferencing ...
This paper investigates improving lightly supervised acous-tic model training for an archive of broa...
Speech interfaces provide a natural and accessible mechanism allowing users that cannot read to inte...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...