This paper presents the development and evaluation of an automatic audio indexing system designed for a special task: work in a bilingual environment in the Parliament of the Canton of Valais in Switzerland, with two official languages, German and French. As several speakers are bilingual, language changes may occur within speaker or even within ut-terance. Two audio indexing approaches are presented and compared: in the first, speech indexing is based on bilingual automatic speech recogni-tion; in the second, language identification is used after speaker diarization in order to select the corresponding monolingual speech recognizer for de-coding. The approaches are later combined. Speaker adaptive training is also addressed and evaluated. ...
Abstract. This paper describes a designed and implemented system for efficient storage, indexing and...
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news shows are mainly...
This paper presents a system to identify the spoken language in challenging audio material such as b...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
MediaParl is a Swiss accented bilingual database containing recordings in both French and German as ...
Colloque sur invitation. internationale.International audienceThis paper presents an overview of aud...
This paper describes the main features of KALAKA-3, a speech database specifically designed for the ...
This chapter will focus on the automatic extraction of information from the speech in multimedia doc...
We present two concepts for systems with language identification in the context of multilingual info...
The Gong system has been developed for web based communication. It supports synchronous and asynchro...
This paper describes the setting up of a resource database for research and evaluation in the domain...
Automatic spoken Language Identi¯cation (LID) is the process of identifying the language spoken with...
In order to help journalists investigate inside large audiovisual archives, as maintained by news br...
This paper presents an overview of audio indexing, which has emerged very recently as a research top...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
Abstract. This paper describes a designed and implemented system for efficient storage, indexing and...
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news shows are mainly...
This paper presents a system to identify the spoken language in challenging audio material such as b...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
MediaParl is a Swiss accented bilingual database containing recordings in both French and German as ...
Colloque sur invitation. internationale.International audienceThis paper presents an overview of aud...
This paper describes the main features of KALAKA-3, a speech database specifically designed for the ...
This chapter will focus on the automatic extraction of information from the speech in multimedia doc...
We present two concepts for systems with language identification in the context of multilingual info...
The Gong system has been developed for web based communication. It supports synchronous and asynchro...
This paper describes the setting up of a resource database for research and evaluation in the domain...
Automatic spoken Language Identi¯cation (LID) is the process of identifying the language spoken with...
In order to help journalists investigate inside large audiovisual archives, as maintained by news br...
This paper presents an overview of audio indexing, which has emerged very recently as a research top...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
Abstract. This paper describes a designed and implemented system for efficient storage, indexing and...
This paper describes a Broadcast News (BN) database called Transcrigal-DB. The news shows are mainly...
This paper presents a system to identify the spoken language in challenging audio material such as b...