This thesis presents a cloud platform for automatic speech recognition, CloudASR, built on top of Kaldi speech recognition toolkit. The platform sup- ports both batch and online speech recognition mode and it has an annotation interface for transcription of the submitted recordings. The key features of the platform are scalability, customizability and easy deployment. Benchmarks of the platform show that the platform achieves comparable performance with Google Speech API in terms of latency and it can achieve better accuracy on limited domains. Furthermore, the benchmarks show that the platform is able to handle more than 1000 parallel requests given enough computational resources.
Those who are speech impaired (tunawicara in the Indonesian language) suffer from abnormalities in t...
Automatic speech recognition (ASR) technology has been developed to such a level that off-the-shelf ...
We describe a scalable architecture, particularly well-suited to cloud-based computing, which can be...
This paper presents the most recent developments of the webASR service (www.webasr.org), the world’...
The aim of this project is to simplify the deployment of an automatic speech recognition (ASR) appli...
This project aims to improve the scalability of the existing speech recognition system such that it ...
This work aims to propose a solution containing an automatic speech recognition system in cloud. Thu...
MAGOR is a web application that provides audio and video transcription services. The MAGOR system co...
Deep learning technology has encouraged research on noise-robust automatic speech recognition (ASR)....
This project aims to improve a speech-to-text web application that enables users to transcribe audio...
The CloudCAST platform provides a series of speech recognition services that can be integrated into ...
Automatic speech recognition (ASR) technology has been developed to such a level that off-the-shelf ...
We present ‘webASR’, an online interface to our state-of-the-art automatic speech recognition (ASR) ...
Distributed and parallel processing of big data has been applied in various applications for the pas...
Face recognition is one among the foremost wide used technologies, from a phone's lock screen to the...
Those who are speech impaired (tunawicara in the Indonesian language) suffer from abnormalities in t...
Automatic speech recognition (ASR) technology has been developed to such a level that off-the-shelf ...
We describe a scalable architecture, particularly well-suited to cloud-based computing, which can be...
This paper presents the most recent developments of the webASR service (www.webasr.org), the world’...
The aim of this project is to simplify the deployment of an automatic speech recognition (ASR) appli...
This project aims to improve the scalability of the existing speech recognition system such that it ...
This work aims to propose a solution containing an automatic speech recognition system in cloud. Thu...
MAGOR is a web application that provides audio and video transcription services. The MAGOR system co...
Deep learning technology has encouraged research on noise-robust automatic speech recognition (ASR)....
This project aims to improve a speech-to-text web application that enables users to transcribe audio...
The CloudCAST platform provides a series of speech recognition services that can be integrated into ...
Automatic speech recognition (ASR) technology has been developed to such a level that off-the-shelf ...
We present ‘webASR’, an online interface to our state-of-the-art automatic speech recognition (ASR) ...
Distributed and parallel processing of big data has been applied in various applications for the pas...
Face recognition is one among the foremost wide used technologies, from a phone's lock screen to the...
Those who are speech impaired (tunawicara in the Indonesian language) suffer from abnormalities in t...
Automatic speech recognition (ASR) technology has been developed to such a level that off-the-shelf ...
We describe a scalable architecture, particularly well-suited to cloud-based computing, which can be...