International audienceSpeech processing encompases a variety of technologies that automatically process speech for some downstream processing. These technologies include identifying the language or dialect spoken, the person speaking, what is said and how it is said. The downstream processing may be limited to a transcription or to a transcription enhanced with additional metadata, or may be used to carry out an action or interpreted within a spoken dialog system or more generally for analytics. With the availability of large spoken multimedia or multimodal data there is growing interest in using such technologies to provide structure and random access to particular segments. Automatic tools can also serve to annotate large corpora for expl...
Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as ...
In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual spe...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
International audienceEnormous progress in speech technologies has been achieved over the last twode...
International audienceSpoken language processing technologies are principle components inmost of the...
A comparative analysis of multi-language speech samples is conducted using acoustic characteristics ...
We present an analysis pipeline and best practice guidelines for building and curating corpora of ev...
Languages are fundamental to human communication and serve as a means to express social and cultural...
In recent years, more and more speech processing products in several languages have been widely dist...
Machine learning has revolutionised speech technologies for major world languages, but these techno...
This article discusses the role of language development in early acquisition, multilingualism, and s...
Abstract. Advances in human language technology oer the promise of pervasive access to on-line infor...
Machine learning has revolutionized speech technologies for major world languages, but these technol...
The processes by which listeners recognize spoken language are highly lan-guage-specific. Listeners ...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as ...
In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual spe...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
International audienceEnormous progress in speech technologies has been achieved over the last twode...
International audienceSpoken language processing technologies are principle components inmost of the...
A comparative analysis of multi-language speech samples is conducted using acoustic characteristics ...
We present an analysis pipeline and best practice guidelines for building and curating corpora of ev...
Languages are fundamental to human communication and serve as a means to express social and cultural...
In recent years, more and more speech processing products in several languages have been widely dist...
Machine learning has revolutionised speech technologies for major world languages, but these techno...
This article discusses the role of language development in early acquisition, multilingualism, and s...
Abstract. Advances in human language technology oer the promise of pervasive access to on-line infor...
Machine learning has revolutionized speech technologies for major world languages, but these technol...
The processes by which listeners recognize spoken language are highly lan-guage-specific. Listeners ...
Only a handful of the world’s languages are abundant with the resources that enable practical applic...
Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as ...
In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual spe...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...