AbstractScarcity of resources in under resourced languages may leave these languages behind in race of development of data driven NLP systems. Crowdsourcing has come up as a technique to bridge this gap, as it offers approach for collecting such resources in collaborative manner. Though some of Indian languages are widely spoken throughout the world yet many of them are resource poor when it is measured in terms of availability of transcribed and annotated resources for building reliable data driven systems. This paper describes an experience of speech data collection for Hindi through mobile using this approach for building automatic speech recognition and other speech based retrieval systems. This approach covers a lot of variety in terms...
Abstract---Automatic Speech Recognition System (ASR) is helpful for interaction between human and ma...
We present a method to expand the number of languages covered by simple speech recognizers. Enabling...
Machine learning has revolutionised speech technologies for major world languages, but these technol...
We describe the integration of several tools to enable the end-to-end development of an Automatic Sp...
Automatic Speech Recognition (ASR) researchers are turning their attention towards supporting low-re...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
Linguistic code switching (LCS) occurs when speakers mix multiple languages in the same speech utter...
Mismatched crowdsourcing was recently proposed as a poten-tial approach to deriving moderately accur...
Recent work has established the efficacy of Amazon’s Mechanical Turk for constructing parallel corpo...
A Voice Oriented Interactive Computing Environment (VOICE) has been implemented in the Hindi languag...
Low resource languages possess a limited number of digitized texts, making it challenging togenerate...
Recent methods in speech and language technology pretrain very large models which are fine-tuned for...
Maximum digital information is available to fewer people who can read or understand a particular lan...
Speech is a natural means of communication between humans. Human being tried to develop computer tha...
Abstract---Automatic Speech Recognition System (ASR) is helpful for interaction between human and ma...
We present a method to expand the number of languages covered by simple speech recognizers. Enabling...
Machine learning has revolutionised speech technologies for major world languages, but these technol...
We describe the integration of several tools to enable the end-to-end development of an Automatic Sp...
Automatic Speech Recognition (ASR) researchers are turning their attention towards supporting low-re...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Transcribed speech is a critical resource for building statistical speech recognition systems. Recen...
Linguistic code switching (LCS) occurs when speakers mix multiple languages in the same speech utter...
Mismatched crowdsourcing was recently proposed as a poten-tial approach to deriving moderately accur...
Recent work has established the efficacy of Amazon’s Mechanical Turk for constructing parallel corpo...
A Voice Oriented Interactive Computing Environment (VOICE) has been implemented in the Hindi languag...
Low resource languages possess a limited number of digitized texts, making it challenging togenerate...
Recent methods in speech and language technology pretrain very large models which are fine-tuned for...
Maximum digital information is available to fewer people who can read or understand a particular lan...
Speech is a natural means of communication between humans. Human being tried to develop computer tha...
Abstract---Automatic Speech Recognition System (ASR) is helpful for interaction between human and ma...
We present a method to expand the number of languages covered by simple speech recognizers. Enabling...
Machine learning has revolutionised speech technologies for major world languages, but these technol...