For two years, the European Telematics project MATE has worked towards facilitating re-use of annotated spoken language data, addressing theoretical issues and implementing practical solutions which could serve as standards in the field. The resulting MATE Workbench for corpus annotation is now available as licensed open source software. This paper describes the MATE markup framework which is proposed as a standard for the definition and representation of markup for spoken dialogue corpora, and presents early experience from use of the framework
In this paper annotation modularity and use of annotation meta-schemes are identified as basic requi...
The paper presents best practices and results from projects dedicated to the creation of corpora of ...
Comunicació presentada a: Workshop on Multilingual Language Resources and Interoperability celebrat ...
This paper describes the European MATE project and its work towards speech corpus annotation standar...
The growing commercialisation and sophistication of spoken dialogue systems has increased the need f...
The increasing variety and sophistication of spoken language dialogue systems (SLDSs) emphasises the...
The MATE workbench is a tool which aims to simplify the tasks of annotating, displaying and querying...
ADAM is a corpus of annotated spoken dialogues currently being developed as part of the Italian nat...
For the AMITIS multilingual human-computer dialogue project [1], we have developed new methods for t...
Representing annotated spoken corpora The annotation of linguistic resources has long-standing tradi...
With the growing availability of spoken language corpora more and more data driven research in phone...
This paper describes the Linguistic Annotation Framework under development within ISO TC37 SC4 WG1...
International audienceAnnotating corpora is of crucial importance in Corpus Linguistics. Linguistics...
Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects...
This paper describes a world-wide web based system which allows a speech data corpus developer to in...
In this paper annotation modularity and use of annotation meta-schemes are identified as basic requi...
The paper presents best practices and results from projects dedicated to the creation of corpora of ...
Comunicació presentada a: Workshop on Multilingual Language Resources and Interoperability celebrat ...
This paper describes the European MATE project and its work towards speech corpus annotation standar...
The growing commercialisation and sophistication of spoken dialogue systems has increased the need f...
The increasing variety and sophistication of spoken language dialogue systems (SLDSs) emphasises the...
The MATE workbench is a tool which aims to simplify the tasks of annotating, displaying and querying...
ADAM is a corpus of annotated spoken dialogues currently being developed as part of the Italian nat...
For the AMITIS multilingual human-computer dialogue project [1], we have developed new methods for t...
Representing annotated spoken corpora The annotation of linguistic resources has long-standing tradi...
With the growing availability of spoken language corpora more and more data driven research in phone...
This paper describes the Linguistic Annotation Framework under development within ISO TC37 SC4 WG1...
International audienceAnnotating corpora is of crucial importance in Corpus Linguistics. Linguistics...
Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects...
This paper describes a world-wide web based system which allows a speech data corpus developer to in...
In this paper annotation modularity and use of annotation meta-schemes are identified as basic requi...
The paper presents best practices and results from projects dedicated to the creation of corpora of ...
Comunicació presentada a: Workshop on Multilingual Language Resources and Interoperability celebrat ...