Multilingual speech recognition obviously involves numerous research challenges, including common phoneme sets, adaptation on limited amount of training data, as well as mixed language recognition (common in many countries, like Switzerland). In this latter case, it is not even possible to assume that one knows in advance the language being spoken. This is the context and motivation of the present work. We indeed investigate how current state-of-the-art speech recognition systems can be exploited in multilingual environments, where the language (from an assumed set of five possible languages, in our case) is not a priori known during recognition. We combine monolingual systems and extensively develop and compare different features and acous...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
The field of speaker and language recognition is constantly being researched and developed, but much...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
The process of determining the language of a speech utterance is called Language Identification (LID...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
End-to-end trainable deep neural networks have become the state-of-the-art architecture for automati...
This paper describes our work in developing multilingual (Swedish and English) speech recognition sy...
This paper presents a new approach to estimate "universal" phoneme posterior probabilities for mixed...
This paper describes our work in developing multilingual (Swedish and English) speech recognition sy...
We present two concepts for systems with language identification in the context of multilingual info...
This paper describes our work in developing a bilingual speech recognition system using two SpeechDa...
Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as ...
This thesis explores methods to rapidly bootstrap automatic speech recognition systems for languages...
Articulatory features describe the way in which the speech organs are used when producing speech sou...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
The field of speaker and language recognition is constantly being researched and developed, but much...
Automatic speech recognition systems have so far been developed only for very few languages out of t...
Automatic speech recognition requires many hours of transcribed speech recordings in order for an ac...
The process of determining the language of a speech utterance is called Language Identification (LID...
Abstract: In this paper, phoneme sequences are used as language information to perform code-switched...
End-to-end trainable deep neural networks have become the state-of-the-art architecture for automati...
This paper describes our work in developing multilingual (Swedish and English) speech recognition sy...
This paper presents a new approach to estimate "universal" phoneme posterior probabilities for mixed...
This paper describes our work in developing multilingual (Swedish and English) speech recognition sy...
We present two concepts for systems with language identification in the context of multilingual info...
This paper describes our work in developing a bilingual speech recognition system using two SpeechDa...
Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as ...
This thesis explores methods to rapidly bootstrap automatic speech recognition systems for languages...
Articulatory features describe the way in which the speech organs are used when producing speech sou...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
The field of speaker and language recognition is constantly being researched and developed, but much...
Automatic speech recognition systems have so far been developed only for very few languages out of t...