This paper describes past, ongoing and planned work on the collection and transcription of spoken language samples for all the South African official languages and as part of this the training of researchers in corpus linguistic research skills. More specifically the work has involved (and still involves) establishing an international corpus linguistic network linked to a network hub at a UNISA website and the development of research tools, a corpus research guide and workbook for multimodal communication and spoken language corpus research. As an example of the work we are doing and hope to do more of in the future, we present a small pilot study of the influence of English and Afrikaans on the 100 most frequent words in spoken Xhosa as th...
There are currently two distinct but not necessarily mutually exclusive approaches to the retrieval ...
The trends emerging in the natural language processing (NLP) of African languages spoken in South Af...
We present a comparison between four South African languages, based on annotated speech databases ga...
This paper describes past, ongoing and planned work on the collection and transcription of spoken la...
In this paper we give an outline of a corpus planning project which aims to develop linguistic resou...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
This work was supported by the Department of Arts and Culture.The NCHLT speech corpus contains wide-...
This paper describes the proposed structure and design for a corpus of Xhosa English, which should u...
The development of linguistic resources for use in natural language processing is of utmost importan...
Abstract: Within the last twenty years, the use of a corpus for language research has become the si...
The advent of electronic corpora has revolutionized linguistic investigation internationally and is ...
We present a corpus-based analysis of the Afrikaans, English, Xhosa and Zulu languages, comparing th...
The point of departure of the present article is the realisation that more and more serious contempo...
<p>Abstract: Within the last twenty years, the use of a corpus for language research has becom...
There are currently two distinct but not necessarily mutually exclusive approaches to the retrieval ...
The trends emerging in the natural language processing (NLP) of African languages spoken in South Af...
We present a comparison between four South African languages, based on annotated speech databases ga...
This paper describes past, ongoing and planned work on the collection and transcription of spoken la...
In this paper we give an outline of a corpus planning project which aims to develop linguistic resou...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
This work was supported by the Department of Arts and Culture.The NCHLT speech corpus contains wide-...
This paper describes the proposed structure and design for a corpus of Xhosa English, which should u...
The development of linguistic resources for use in natural language processing is of utmost importan...
Abstract: Within the last twenty years, the use of a corpus for language research has become the si...
The advent of electronic corpora has revolutionized linguistic investigation internationally and is ...
We present a corpus-based analysis of the Afrikaans, English, Xhosa and Zulu languages, comparing th...
The point of departure of the present article is the realisation that more and more serious contempo...
<p>Abstract: Within the last twenty years, the use of a corpus for language research has becom...
There are currently two distinct but not necessarily mutually exclusive approaches to the retrieval ...
The trends emerging in the natural language processing (NLP) of African languages spoken in South Af...
We present a comparison between four South African languages, based on annotated speech databases ga...