We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for transcription of spoken language data. Utterances with varying speaker demographics (native and non-native English, male and female) were posted on the MTurk marketplace together with standard transcription guidelines. Transcriptions were compared against transcriptions carefully prepared in-house through conventional (manual) means. We found that transcriptions from MTurk workers were generally quite accurate. Further, when transcripts for the same utterance produced by multiple workers were combined using the ROVER voting scheme, the accuracy of the combined transcript rivaled that observed for conventional transcription methods. We also f...
Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasing...
Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Tran...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
We investigate whether Amazon's Mechanical Turk (MTurk) service can be used as a reliable method ...
This study investigates the use of Amazon Mechanical Turk for the transcription of non-native speech...
We present a large scale study of the languages spoken by bilingual workers on Mechanical Turk (MTur...
We present a large scale study of the languages spoken by bilingual workers on Mechanical Turk (MTur...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Amazon’s Mechanical Turk service makes linguistic experimentation quick, easy, and inexpensive. Howe...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
In this paper, I argue for the use of Amazon Mechanical Turk (AMT) in language research. AMT is an o...
This paper describes a framework for evaluation of spoken dialogue systems. Typically, evaluation of...
Researchers have increasingly turned to Amazon Mechanical Turk (AMT) to crowdsource speech data, pre...
ccb cs jhu edu Manual evaluation of translation quality is generally thought to be excessively time ...
Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasing...
Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Tran...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
We investigate whether Amazon's Mechanical Turk (MTurk) service can be used as a reliable method ...
This study investigates the use of Amazon Mechanical Turk for the transcription of non-native speech...
We present a large scale study of the languages spoken by bilingual workers on Mechanical Turk (MTur...
We present a large scale study of the languages spoken by bilingual workers on Mechanical Turk (MTur...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Amazon’s Mechanical Turk service makes linguistic experimentation quick, easy, and inexpensive. Howe...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
In this paper, I argue for the use of Amazon Mechanical Turk (AMT) in language research. AMT is an o...
This paper describes a framework for evaluation of spoken dialogue systems. Typically, evaluation of...
Researchers have increasingly turned to Amazon Mechanical Turk (AMT) to crowdsource speech data, pre...
ccb cs jhu edu Manual evaluation of translation quality is generally thought to be excessively time ...
Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasing...
Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Tran...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...