Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in everyday communication. Despite this fact, automatic processing of Swiss German is still a considerable challenge due to the fact that it is mostly a spoken variety and that it is subject to considerable regional variation. This paper presents the ArchiMob corpus, a freely available general-purpose corpus of spoken Swiss German based on oral history interviews. The corpus is a result of a long design process, intensive manual work and specially adapted computational processing. We first present the modalities of access of the corpus for linguistic, historic and computational research. We then describe how the documents were transcribed, segmen...
Since 2015, the project Sprachalltag II has been running at the Institute of Historical and Cultural...
This thesis proposes to combine methods and data from two rather distant fields of language science ...
The Corpus of Austrian Dialect Recordings from the 20th Century comprises 2442 dialect recordings fr...
Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in e...
Although Swiss dialects of German are widely used in everyday communication, automatic processing of...
Swiss dialects of German are, unlike most dialects of well standardised languages, widely used in ev...
Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in e...
Although Swiss dialects of German are widely used in everyday communication, automatic processing of...
Alemannische Dialektologie – Forschungsstand und Perspektiven. SonderheftPeer reviewe
<p>The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the...
Swiss German is a dialect continuum whose dialects are very different from Standard German, the offi...
To study and automatically process Swiss German, it is necessary to resolve the issue of variation i...
In this paper, we report on recent digitization efforts of the linguistic atlas of German-speaking S...
The research project “German Today” aims to determine the amount of regional variation in (near-)sta...
Most Natural Language Processing (NLP) applications focus on standardized, written language varietie...
Since 2015, the project Sprachalltag II has been running at the Institute of Historical and Cultural...
This thesis proposes to combine methods and data from two rather distant fields of language science ...
The Corpus of Austrian Dialect Recordings from the 20th Century comprises 2442 dialect recordings fr...
Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in e...
Although Swiss dialects of German are widely used in everyday communication, automatic processing of...
Swiss dialects of German are, unlike most dialects of well standardised languages, widely used in ev...
Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in e...
Although Swiss dialects of German are widely used in everyday communication, automatic processing of...
Alemannische Dialektologie – Forschungsstand und Perspektiven. SonderheftPeer reviewe
<p>The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the...
Swiss German is a dialect continuum whose dialects are very different from Standard German, the offi...
To study and automatically process Swiss German, it is necessary to resolve the issue of variation i...
In this paper, we report on recent digitization efforts of the linguistic atlas of German-speaking S...
The research project “German Today” aims to determine the amount of regional variation in (near-)sta...
Most Natural Language Processing (NLP) applications focus on standardized, written language varietie...
Since 2015, the project Sprachalltag II has been running at the Institute of Historical and Cultural...
This thesis proposes to combine methods and data from two rather distant fields of language science ...
The Corpus of Austrian Dialect Recordings from the 20th Century comprises 2442 dialect recordings fr...