AbstractIn this work, the tone modeling approaches are used manifest the tonal structure of Vietnamese and tonal feature is also used to build acoustic models. The results on LVCSR using deep bottleneck features (DBNFs) and different types of pronouncing dictionary, are also presented. The experiments are carried out on the dataset containing speeches on Voice of Vietnam channel (VOV). The results show that the performance of the system using tonal phoneme obtained relative improvements over the best non-tonal phoneme system by 19.25%. The DBNFs systems are applicable on tonal dictionary and adding tonal feature as input feature of the network reached around 18% relative recognition performance
The paper provides a description and a small investigation of the Vietnamese tonal system in differe...
This paper is a preliminary report on Vietnamese tones in Central Vietnam, the most conservative dia...
International audienceThis paper presents our recent activities for automatic speech recognition for...
Conventional wisdom in automatic speech recognition asserts that pitch information is not helpful in...
International audienceThis paper proposes a method to build a Vietnamese Large Vocabulary Continuous...
<p>Conventional wisdom in automatic speech recognition asserts that pitch information is not helpful...
In this paper, the pre-training method based on denoising auto-encoder is investigated and proved to...
This paper provides an overall description of the Vietnamese speech recognition system developed by ...
International audience This paper presents our study on contextindependent tone recognition of Vie...
International audienceThis paper proposes a method to build a Vietna-mese Large Vocabulary Continuou...
Automatic speech recognition for languages in Southeast Asia, including Chinese, Thai and Vietnamese...
This paper presents our first steps in fast acoustic modeling for a new target language. Both knowle...
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Con...
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Con...
Understanding and managing tonal characteristics of Vietnamese language is one of the most difficult...
The paper provides a description and a small investigation of the Vietnamese tonal system in differe...
This paper is a preliminary report on Vietnamese tones in Central Vietnam, the most conservative dia...
International audienceThis paper presents our recent activities for automatic speech recognition for...
Conventional wisdom in automatic speech recognition asserts that pitch information is not helpful in...
International audienceThis paper proposes a method to build a Vietnamese Large Vocabulary Continuous...
<p>Conventional wisdom in automatic speech recognition asserts that pitch information is not helpful...
In this paper, the pre-training method based on denoising auto-encoder is investigated and proved to...
This paper provides an overall description of the Vietnamese speech recognition system developed by ...
International audience This paper presents our study on contextindependent tone recognition of Vie...
International audienceThis paper proposes a method to build a Vietna-mese Large Vocabulary Continuou...
Automatic speech recognition for languages in Southeast Asia, including Chinese, Thai and Vietnamese...
This paper presents our first steps in fast acoustic modeling for a new target language. Both knowle...
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Con...
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Con...
Understanding and managing tonal characteristics of Vietnamese language is one of the most difficult...
The paper provides a description and a small investigation of the Vietnamese tonal system in differe...
This paper is a preliminary report on Vietnamese tones in Central Vietnam, the most conservative dia...
International audienceThis paper presents our recent activities for automatic speech recognition for...