The newest generation of speech technology caused a huge increase of audio-visual data nowadays being enhanced with orthographic transcripts such as in automatic subtitling in online platforms. Research data centers and archives contain a range of new and historical data, which are currently only partially transcribed and therefore only partially accessible for systematic querying. Automatic Speech Recognition (ASR) is one option of making that data accessible. This paper tests the usability of a state-of-the-art ASR-System on a historical (from the 1960s), but regionally balanced corpus of spoken German, and a relatively new corpus (from 2012) recorded in a narrow area. We observed a regional bias of the ASR-System with higher recognition ...
We present STT4SG-350, a corpus of Swiss German speech, annotated with Standard German text at the s...
In recent decades, broadcast archives have opened up their collections with automatic speech recogni...
Audio mining systems automatically analyse large amounts of heterogeneous media files such as televi...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
In this paper we describe the large-scale German broadcast corpus (GER-TV1000h) containing more than...
Spoken languages are often rich in regional accents and dialects. These local variations often pose ...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
The research project “German Today” aims to determine the amount of regional variation in (near-) st...
Automatic speech recognition is a very important technique for numerous applications like automatic ...
Automatic speech recognition is a requested technique in many fields like automatic subtitling, dial...
Item does not contain fulltextThe components of the Frisian data collection are speech and language ...
Typical broadcast material contains not only studio-recorded texts read by trained speakers, but als...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
In this paper, we present an end-to-end solution to the development of an automatic speech recogniti...
Modern automatic speech recognition (ASR) systems are speaker independent and designed to recognize ...
We present STT4SG-350, a corpus of Swiss German speech, annotated with Standard German text at the s...
In recent decades, broadcast archives have opened up their collections with automatic speech recogni...
Audio mining systems automatically analyse large amounts of heterogeneous media files such as televi...
The newest generation of speech technology caused a huge increase of audio-visual data nowadays bein...
In this paper we describe the large-scale German broadcast corpus (GER-TV1000h) containing more than...
Spoken languages are often rich in regional accents and dialects. These local variations often pose ...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
The research project “German Today” aims to determine the amount of regional variation in (near-) st...
Automatic speech recognition is a very important technique for numerous applications like automatic ...
Automatic speech recognition is a requested technique in many fields like automatic subtitling, dial...
Item does not contain fulltextThe components of the Frisian data collection are speech and language ...
Typical broadcast material contains not only studio-recorded texts read by trained speakers, but als...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
In this paper, we present an end-to-end solution to the development of an automatic speech recogniti...
Modern automatic speech recognition (ASR) systems are speaker independent and designed to recognize ...
We present STT4SG-350, a corpus of Swiss German speech, annotated with Standard German text at the s...
In recent decades, broadcast archives have opened up their collections with automatic speech recogni...
Audio mining systems automatically analyse large amounts of heterogeneous media files such as televi...