In the past, linguistic research was typically conducted on relatively small datasets that were specifically designed for the research at hand. Whereas to date many large spoken language corpora have become available, the usefulness of these corpora is still not fully established in linguistic research. The research reported on in this paper was conducted to illustrate the potential of large multi-purpose spoken language corpora for linguistic research. The possibility was investigated of identifying phonetic regularities in different speech styles. To this end, a data-driven study was conducted with a large multi-purpose spoken language corpus comprising a manually corrected broad phonetic transcription of the data. Our results show that s...
Speech corpora play an important role in phonetic research and have been applied in daily life. Thei...
Recent advances in access to spoken-language corpora and development of speech processing tools have...
To achieve a robust system the variation seen for different speaking styles must be handled. An inve...
In the past, fundamental linguistic research was typically conducted on small data sets that were ha...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
A major hurdle in data-driven research on typology is having sufficient data in many languages to dr...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
The SPADE project aims to develop and apply user-friendly software for large-scale speech analysis o...
International audienceThe aim of the chapter is twofold: presenting the different types of data that...
International audienceThe aim of the chapter is twofold: presenting the different types of data that...
Defined in its broadest sense as large databases illustrating actual language use, corpora have prov...
Book chapter in A, Ludeling, M. Kytö and T. McEnery (Eds.) "Corpus Linguistics: An International Han...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
International audienceThis paper discusses a method to detect statistically significant linguistic d...
Representing annotated spoken corpora The annotation of linguistic resources has long-standing tradi...
Speech corpora play an important role in phonetic research and have been applied in daily life. Thei...
Recent advances in access to spoken-language corpora and development of speech processing tools have...
To achieve a robust system the variation seen for different speaking styles must be handled. An inve...
In the past, fundamental linguistic research was typically conducted on small data sets that were ha...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
A major hurdle in data-driven research on typology is having sufficient data in many languages to dr...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
The SPADE project aims to develop and apply user-friendly software for large-scale speech analysis o...
International audienceThe aim of the chapter is twofold: presenting the different types of data that...
International audienceThe aim of the chapter is twofold: presenting the different types of data that...
Defined in its broadest sense as large databases illustrating actual language use, corpora have prov...
Book chapter in A, Ludeling, M. Kytö and T. McEnery (Eds.) "Corpus Linguistics: An International Han...
INTRODUCTION Pronunciations in spontaneous, conversational speech tend to be much more variable tha...
International audienceThis paper discusses a method to detect statistically significant linguistic d...
Representing annotated spoken corpora The annotation of linguistic resources has long-standing tradi...
Speech corpora play an important role in phonetic research and have been applied in daily life. Thei...
Recent advances in access to spoken-language corpora and development of speech processing tools have...
To achieve a robust system the variation seen for different speaking styles must be handled. An inve...