This paper describes the development of a multilingual and multigenre manually annotated speech dataset, freely available to the research community as ground truth for the evaluation of automatic transcription systems and spoken language translation systems. The dataset includes two video genres—television broadcast news and talk-shows—and covers Flemish, English, German, and Italian, for a total of about 35 h of television speech. Besides segmentation and orthographic transcription, we added a very rich annotation on the audio signal, both at the linguistic level (e.g. filled pauses, pronunciation errors, disfluencies, speech in a foreign language) and at the acoustic level (e.g. background noise and different types of non-speech events). ...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
The paper describes recent progress in the development the Slovak language models for transcription ...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
We collect and release CrowdSpeech — the first publicly available large-scale dataset of crowdsource...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
International audienceThis paper reports on a speech-to-text (STT) transcription system for Hungaria...
Audio captioning is a novel field of multi-modal translation and it is the task of creating a textua...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
This paper presents some recent improvements in automatic transcription of Italian broadcast news ob...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
The paper describes recent progress in the development the Slovak language models for transcription ...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...
This paper describes the development of a multilingual and multigenre manually annotated speech data...
This paper presents the results of an experimental study conducted with the aim of comparing two met...
We collect and release CrowdSpeech — the first publicly available large-scale dataset of crowdsource...
A system and method are disclosed to train speech transcription models via crowdsourcing. Users of a...
Crowdsourcing can be defined as the purchase of data (labels, speech recordings, etc.), usually on l...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
International audienceThis paper reports on a speech-to-text (STT) transcription system for Hungaria...
Audio captioning is a novel field of multi-modal translation and it is the task of creating a textua...
This paper introduces a method to produce high-quality transcrip- tions of speech data from only two...
There is an enormous amount of recorded speech generated daily, and quickly transcribing and analyzi...
This paper presents some recent improvements in automatic transcription of Italian broadcast news ob...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
One rapidly expanding application area for state-of-the-art speech recognition technology is the au...
The paper describes recent progress in the development the Slovak language models for transcription ...
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for...