The Formosa speech database (ForSDat) is a multilingual speech corpus collected at Chang Gung University and sponsored by the National Science Council of Taiwan. It is expected that a multilingual speech corpus will be collected, covering the three most frequently used languages in Taiwan: Taiwanese (Min-nan), Hakka, and Mandarin. This 3-year project has the goal of collecting a phonetically abundant speech corpus of more than 1,800 speakers and hundreds of hours of speech. Recently, the first version of this corpus containing speech of 600 speakers of Taiwanese and Mandarin was finished and is ready to be released. It contains about 49 hours of speech and 247,000 utterances
Research in Chinese speech synthesis and speech recognition began nearly two decades ago in Taiwan. ...
This paper describes a project that aims to create a Mandarin speech database for the automobile set...
Taiwanese Child Language Corpus (TAICORP) is a corpus based on spontaneous conversations between you...
[[abstract]]Here, we describe an efficient algorithm to select phonetically balanced scripts for col...
In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialec...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
Information technologies have now matured to the point of enabling researchers to create a repositor...
Speech corpus is the basis for analyzing the characteristics of speech signals and developing speech...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
It is well understood that the speech databases play a very important role for speech recognition. I...
[[abstract]]Corpora, in their different forms for different purposes, have been the bases for modern...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
[[abstract]]Mandarin speech data Across Taiwan (MAT) is a project initiated by members of the Associ...
Parallel corpus is a valuable resource for cross-language information retrieval and data-driven natu...
In 2016-2017, a 10-person team worked for 3 months with the goal of creating a multi-use, balanced c...
Research in Chinese speech synthesis and speech recognition began nearly two decades ago in Taiwan. ...
This paper describes a project that aims to create a Mandarin speech database for the automobile set...
Taiwanese Child Language Corpus (TAICORP) is a corpus based on spontaneous conversations between you...
[[abstract]]Here, we describe an efficient algorithm to select phonetically balanced scripts for col...
In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialec...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
Information technologies have now matured to the point of enabling researchers to create a repositor...
Speech corpus is the basis for analyzing the characteristics of speech signals and developing speech...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
It is well understood that the speech databases play a very important role for speech recognition. I...
[[abstract]]Corpora, in their different forms for different purposes, have been the bases for modern...
The NTU-MC is a multilingual corpus that taps on the availability of multilingual text available in ...
[[abstract]]Mandarin speech data Across Taiwan (MAT) is a project initiated by members of the Associ...
Parallel corpus is a valuable resource for cross-language information retrieval and data-driven natu...
In 2016-2017, a 10-person team worked for 3 months with the goal of creating a multi-use, balanced c...
Research in Chinese speech synthesis and speech recognition began nearly two decades ago in Taiwan. ...
This paper describes a project that aims to create a Mandarin speech database for the automobile set...
Taiwanese Child Language Corpus (TAICORP) is a corpus based on spontaneous conversations between you...