Large and open multiparallel corpora are a valuable resource for contrastive corpus linguists if the data is annotated and stored in a way that allows precise and flexible ad hoc searches. A linguistic query language should also support computational linguists in automated multilingual data mining. We review a broad range of approaches for linguistic query and reporting languages according to usability criteria such as expressibility, expressiveness, and efficiency. We propose an architecture that tries to strike the right balance to suit practical purposes
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `t...
Searching corpora with linguistic questions requires both additional information encoded in the corp...
This paper starts by discussing the reasons why linguists should be interested in parallel corpora. ...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
This paper proposes a methodology for querying linguistic data represented in different corpus forma...
The paper presents MDDQL as a query language suitable for multi-lingual conceptual querying of colle...
Recent years have seen an increased interest in and availability of many different kinds of corpora....
With the growing availability of spoken language corpora more and more data driven research in phone...
The usefulness of annotated corpora is greatly increased if there is an associated tool that can all...
Comunicació presentada a: EACL '06: Eleventh Conference of the European Chapter of the Association f...
Increasing numbers of linguists have multimodal data of relatively rare languages, richly annotated ...
We present an approach for searching and exploring translation variants of multi-word units in large...
Linguistic query systems are special purpose IR applications. As text sizes, annotation layers, and ...
Linguistic query systems are special purpose IR applications. As text sizes, annotation layers, and ...
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `t...
Searching corpora with linguistic questions requires both additional information encoded in the corp...
This paper starts by discussing the reasons why linguists should be interested in parallel corpora. ...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
The availability of large multi-parallel corpora offers an enormous wealth of material to contrastiv...
This paper proposes a methodology for querying linguistic data represented in different corpus forma...
The paper presents MDDQL as a query language suitable for multi-lingual conceptual querying of colle...
Recent years have seen an increased interest in and availability of many different kinds of corpora....
With the growing availability of spoken language corpora more and more data driven research in phone...
The usefulness of annotated corpora is greatly increased if there is an associated tool that can all...
Comunicació presentada a: EACL '06: Eleventh Conference of the European Chapter of the Association f...
Increasing numbers of linguists have multimodal data of relatively rare languages, richly annotated ...
We present an approach for searching and exploring translation variants of multi-word units in large...
Linguistic query systems are special purpose IR applications. As text sizes, annotation layers, and ...
Linguistic query systems are special purpose IR applications. As text sizes, annotation layers, and ...
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `t...
Searching corpora with linguistic questions requires both additional information encoded in the corp...
This paper starts by discussing the reasons why linguists should be interested in parallel corpora. ...