In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech corpus. The corpus is intended for training and testing of existing audio-visual speech recognition system. The name of the database is UWB-07-ICAVR, where ICAVR stands for Impaired Condition Audio Visual speech Recognition. The corpus consist of 10000 utterances of continuous speech obtained from 50 speakers. The total length of the database is 25 hours. Each utterance is stored as a separate sentence. The corpus extends existing databases by covering condition of variable illumination. We acquired 50 speakers, where half of them were men and half of them were women. Recording was done by two cameras and two microphones. Database introduced in...
We describe the design, recording and content of a Czech Sign Language database in this paper. The d...
This paper describes a new Slovak speech recognition dedicated corpus built from TEDx talks and Jump...
The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very prec...
In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech co...
In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech co...
The paper presents the results of recent experiments with audio-visual speech recognition for two po...
Audiovisual speech recognition (AVSR) systems have been proven superior over audio-only speech recog...
Multimodal signal processing has become an important topic of research for overcoming certain proble...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
The corpus consists of transcribed recordings from the Czech political discussion broadcast “Otázky ...
A database of video sequences of a male speaker uttering 269 Swedish sentences and 153 VCV-words has...
This thesis deals with voice activity detection (VAD) and requirements for creating a speech databas...
We present a large corpus of Czech parliament plenary sessions. The corpus consists of approximately...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
Práce se zabývá rozpoznáváním řeči a tvorbou řečové databáze, která bude sloužit jako trénovací a te...
We describe the design, recording and content of a Czech Sign Language database in this paper. The d...
This paper describes a new Slovak speech recognition dedicated corpus built from TEDx talks and Jump...
The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very prec...
In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech co...
In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech co...
The paper presents the results of recent experiments with audio-visual speech recognition for two po...
Audiovisual speech recognition (AVSR) systems have been proven superior over audio-only speech recog...
Multimodal signal processing has become an important topic of research for overcoming certain proble...
This thesis describes how an automatic lip reader was realized. Visual speech recognition is a preco...
The corpus consists of transcribed recordings from the Czech political discussion broadcast “Otázky ...
A database of video sequences of a male speaker uttering 269 Swedish sentences and 153 VCV-words has...
This thesis deals with voice activity detection (VAD) and requirements for creating a speech databas...
We present a large corpus of Czech parliament plenary sessions. The corpus consists of approximately...
The corpus consists of recordings from the Chamber of Deputies of the Parliament of the Czech Republ...
Práce se zabývá rozpoznáváním řeči a tvorbou řečové databáze, která bude sloužit jako trénovací a te...
We describe the design, recording and content of a Czech Sign Language database in this paper. The d...
This paper describes a new Slovak speech recognition dedicated corpus built from TEDx talks and Jump...
The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very prec...