[[abstract]]自動語音識別運用在許多交通工具的人機互動和人車互動應用上。但為了有效的使用,我們必需先找到一種方法來判斷我們是否正準備使用此功能。因此,語音活動檢測是語音識別的關鍵步驟。我們利用脣形動作來驅動語音檢測的功能。 基於脣形啟動之語音偵測系統是使用臉部檢測和臉部識別來找到脣部的位置,並且運用影像處理來設定四個嘴脣的特徵點。使系統可以利用這四個特徵點的距離來決定使用者是否要開啟語音辨識的功能。[[abstract]]Automatic speech recognition is used in many applications for human-computer interaction and human vehicle interaction on mobile and vehicular platforms. To apply this function, we need to find a way to detect whether we are using it or not. So voice activity detection is the essential step in speech recognition. We use lip-motion-activated speech detection to achieve the function. This lip-motion-activated speech detection system uses face detection and facial recognition to find the location of lip, and defines four feature ...
In this paper, we propose a corpus-based lip-sync algorithm for natural face animation. Audio-visual...
Speech has information more than text, but under noisy environment speech sufferance from disadvanta...
ABSTRACT We present the development of a modular system for flexible human-computer interaction via ...
In recent research efforts, the integration of visual cues into speech analysis systems has been pro...
O movimento dos lábios é um recurso visual relevante para a detecção da atividade de voz do locutor ...
International audienceThis paper presents a quantitative and comprehensive study of the lip movement...
While they might not even notice it. humans use their eyes when they are understanding speech. Espec...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
When combined with acoustical speech information, visual speech information (lip movement) significa...
Speech has information more than text, but under noisy environment speech sufferance from disadvanta...
By identifying lip movements and characterizing their associations with speech sounds, the performan...
643-650Recognition of Lip movements has become one of the most challenging tasks and has crucial app...
In this paper, we propose a corpus-based lip-sync algorithm for natural face animation. Audio-visual...
Speech has information more than text, but under noisy environment speech sufferance from disadvanta...
ABSTRACT We present the development of a modular system for flexible human-computer interaction via ...
In recent research efforts, the integration of visual cues into speech analysis systems has been pro...
O movimento dos lábios é um recurso visual relevante para a detecção da atividade de voz do locutor ...
International audienceThis paper presents a quantitative and comprehensive study of the lip movement...
While they might not even notice it. humans use their eyes when they are understanding speech. Espec...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations...
This paper proposes a lip reading technique for speech recognition by using motion estimation analys...
When combined with acoustical speech information, visual speech information (lip movement) significa...
Speech has information more than text, but under noisy environment speech sufferance from disadvanta...
By identifying lip movements and characterizing their associations with speech sounds, the performan...
643-650Recognition of Lip movements has become one of the most challenging tasks and has crucial app...
In this paper, we propose a corpus-based lip-sync algorithm for natural face animation. Audio-visual...
Speech has information more than text, but under noisy environment speech sufferance from disadvanta...
ABSTRACT We present the development of a modular system for flexible human-computer interaction via ...