Dear User, We are thrilled to introduce our latest release - the RescueSpeech audio dataset, comprising authentic German speech recordings obtained from simulated search and rescue (SAR) exercises. The dataset contains manually annotated recordings from native German speakers, which were initially captured at 44.1 kHz and later down-sampled to 16 kHz to obtain a set of mono-speaker-single channel audio recordings. In order to protect the identity of the speakers, their names have been anonymized. The RescueSpeech dataset is divided into two sets, each designed for different tasks: Automatic Speech Recognition (ASR) and Speech Enhancement. 1. For the ASR task, the dataset spans a duration of 1 hour and 36 minutes. It comprises a collectio...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
Item does not contain fulltextThe components of the Frisian data collection are speech and language ...
Dear User, We are thrilled to introduce our latest release - the RescueSpeech audio dataset, compri...
In this paper we describe the large-scale German broadcast corpus (GER-TV1000h) containing more than...
Audio mining systems automatically analyse large amounts of heterogeneous media files such as televi...
© 2017 IEEE. As part of an ongoing research into extracting mission-critical information from Search...
Automatic speech recognition (ASR) is a key element in making the dream of natural human-machine com...
This work identifies the causes for unsatisfactory reliability of contemporary systems for automatic...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
The Malach Project [6] verified the possibility of using automatic speech recognition (ASR) methods ...
The need for automatic recognition and understanding of speech is emerging in tasks involving the pr...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
© 2017 IEEE. As part of an ongoing research into extracting mission-critical information from Search...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
Item does not contain fulltextThe components of the Frisian data collection are speech and language ...
Dear User, We are thrilled to introduce our latest release - the RescueSpeech audio dataset, compri...
In this paper we describe the large-scale German broadcast corpus (GER-TV1000h) containing more than...
Audio mining systems automatically analyse large amounts of heterogeneous media files such as televi...
© 2017 IEEE. As part of an ongoing research into extracting mission-critical information from Search...
Automatic speech recognition (ASR) is a key element in making the dream of natural human-machine com...
This work identifies the causes for unsatisfactory reliability of contemporary systems for automatic...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
The Malach Project [6] verified the possibility of using automatic speech recognition (ASR) methods ...
The need for automatic recognition and understanding of speech is emerging in tasks involving the pr...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
© 2017 IEEE. As part of an ongoing research into extracting mission-critical information from Search...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial an...
Item does not contain fulltextThe components of the Frisian data collection are speech and language ...