This article argues that documentary linguistics and corpus phonetics can form a happy marriage in that corpora extracted from language documentation collections contain highly relevant data that can advance corpus phonetics by enabling broad comparative studies. To make this point, this article reviews previous research on phonetic lengthening at utterance boundaries and pause probabilities before nouns and verbs in ten languages. I then introduce the DoReCo initiative, which, based on experience gained from these studies, builds a database of time-aligned corpora from documentary collections of 50 languages for corpus phonetic research and other research purposes
Book chapter in A, Ludeling, M. Kytö and T. McEnery (Eds.) "Corpus Linguistics: An International Han...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
This discussion note reviews responses of the linguistics profession to the grave issues of language...
This paper explores the application of quantitative methods to study the effect of various factors o...
The past two decades have seen an explosion in the quantity of documentary materials available on mi...
International audienceThis paper explores the application of quantitative methods to study the effec...
This paper explores the application of quantitative methods to study the effect of various factors o...
For decades, language documentation proponents have argued for the separability of LD as its own sub...
This paper explores the application of quantitative methods to study the effect of various factors o...
This paper explores the application of quantitative methods to study the effect of various factors o...
International audienceAs the quality and availability of corpora of lesser-documented languages grow...
We advocate for inclusion of phonetic production and perception experiments in language documentatio...
none2noA corpus is a collection of authentic, non-elicited texts selected and assembled to study lan...
I reflect the role of language documentations in linguistic research beyond its most common linguist...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
Book chapter in A, Ludeling, M. Kytö and T. McEnery (Eds.) "Corpus Linguistics: An International Han...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
This discussion note reviews responses of the linguistics profession to the grave issues of language...
This paper explores the application of quantitative methods to study the effect of various factors o...
The past two decades have seen an explosion in the quantity of documentary materials available on mi...
International audienceThis paper explores the application of quantitative methods to study the effec...
This paper explores the application of quantitative methods to study the effect of various factors o...
For decades, language documentation proponents have argued for the separability of LD as its own sub...
This paper explores the application of quantitative methods to study the effect of various factors o...
This paper explores the application of quantitative methods to study the effect of various factors o...
International audienceAs the quality and availability of corpora of lesser-documented languages grow...
We advocate for inclusion of phonetic production and perception experiments in language documentatio...
none2noA corpus is a collection of authentic, non-elicited texts selected and assembled to study lan...
I reflect the role of language documentations in linguistic research beyond its most common linguist...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
Book chapter in A, Ludeling, M. Kytö and T. McEnery (Eds.) "Corpus Linguistics: An International Han...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
This discussion note reviews responses of the linguistics profession to the grave issues of language...