Recognition of tone and intonation is essential for speech recog-nition and language understanding. However, most approaches to this recognition task have relied upon extensive collections of manually tagged data obtained at substantial time and finan-cial cost. In this paper, we explore unsupervised clustering ap-proaches to recognize pitch accent in English and tones in Man-darin Chinese. In unsupervised Mandarin tone clustering exper-iments, we achieve 57-87 % accuracy on materials ranging from broadcast news to clean lab speech. For English pitch accent in broadcast news materials, results reach 78%. These results indi-cate that the intrinsic structure of tone and pitch accent acoustics can be exploited to reduce the need for costly lab...
The automatic identification of prosodic events such as pitch accent in English has long been a topic...
In English and Dutch, listeners entrain to prosodic contours to predict where focus will fall in an ...
This study investigated identification of fragmented Mandarin tones by non-native listeners. Monosyl...
Abstract: Tone recognition is the core function in Chinese speech perception. The tone perception ab...
In this paper, the ability of human listeners to recognize tones from continuous Mandarin Chinese is...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
How to model Chinese tones and integrate them into an HMM-based recognizer for Chinese recognition h...
A deep neural network (DNN) based classifier achieved 27.38 % frame error rate (FER) and 15.62 % seg...
Reliable pitch detection is important in Chinese speech recognition since Chinese is a tonal languag...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
Languages can use a common repertoire of vocal sounds to signify distinct meanings. In tonal languag...
In a previous study we have found that non-tone language speakers are able to form lexical tone cate...
A deep neural network (DNN) classifier based only on 40 mel-frequency cepstral coefficients (MFCCs) ...
This paper reports on a study of 35 Mandarin Chinese learners who (1) created pitch curves of their ...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
The automatic identification of prosodic events such as pitch accent in English has long been a topic...
In English and Dutch, listeners entrain to prosodic contours to predict where focus will fall in an ...
This study investigated identification of fragmented Mandarin tones by non-native listeners. Monosyl...
Abstract: Tone recognition is the core function in Chinese speech perception. The tone perception ab...
In this paper, the ability of human listeners to recognize tones from continuous Mandarin Chinese is...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
How to model Chinese tones and integrate them into an HMM-based recognizer for Chinese recognition h...
A deep neural network (DNN) based classifier achieved 27.38 % frame error rate (FER) and 15.62 % seg...
Reliable pitch detection is important in Chinese speech recognition since Chinese is a tonal languag...
A significant cost in obtaining acoustic training data is the generation of accurate transcriptions....
Languages can use a common repertoire of vocal sounds to signify distinct meanings. In tonal languag...
In a previous study we have found that non-tone language speakers are able to form lexical tone cate...
A deep neural network (DNN) classifier based only on 40 mel-frequency cepstral coefficients (MFCCs) ...
This paper reports on a study of 35 Mandarin Chinese learners who (1) created pitch curves of their ...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
The automatic identification of prosodic events such as pitch accent in English has long been a topic...
In English and Dutch, listeners entrain to prosodic contours to predict where focus will fall in an ...
This study investigated identification of fragmented Mandarin tones by non-native listeners. Monosyl...