In this paper we discuss the five requirements for building large publicly available corpora which geared the construction of the Lácio-Web corpora and their environments: 1) a comprehensive text typology; 2) text copyright clearance, compilation and annotation scheme; 3) a friendly and didactic interface; 4) the need to serve as support for several types of research; 5) the need to offer an array of associated tools. Also, we present the features that make Lácio-Web corpora interesting and novel as well as the limitations of this project, such as corpora size and balance, and the non-inclusion of spoken texts in the project’s reference corpus. 1
The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and g...
We present a newly available online resource for Portuguese, a new version of the Reference Corpus o...
Speech recognition systems use statistical methods based algorithms, and therefore need several trai...
We present our work in processing a Portuguese corpus and its publication online. After discussing ...
The pioneering balanced Brown Corpus launched in 1964, annotated reference corpora, such as Suzanne ...
In this paper we present the Corpógrafo, an integrated web-based environment for corpus linguistics ...
Several Language Resources (LRs) for Portuguese, developed at the Center of Linguistics of the Lisbo...
The extraordinary growth of computer applications, particularly over the last two decades, has enabl...
International audienceIn this article, we present the Brazilian Portuguese Lexicon, a new word-based...
The study reports the results of the exploration of a machine-readable corpus of Brazilian Portugues...
In this work, we present the construction process of a large Web corpus for Brazilian Portuguese, ai...
In this work, we present the construction process of a large Web corpus for Brazilian Portuguese, ai...
The study reports the results of the exploration of a machine-readable corpus of Brazilian Portugues...
We present a newly available on-line resource for Portuguese,a corpus of 310 million words, a ...
The purpose of this paper is to present an overview of Corpus Linguistics, characterizing it as an a...
The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and g...
We present a newly available online resource for Portuguese, a new version of the Reference Corpus o...
Speech recognition systems use statistical methods based algorithms, and therefore need several trai...
We present our work in processing a Portuguese corpus and its publication online. After discussing ...
The pioneering balanced Brown Corpus launched in 1964, annotated reference corpora, such as Suzanne ...
In this paper we present the Corpógrafo, an integrated web-based environment for corpus linguistics ...
Several Language Resources (LRs) for Portuguese, developed at the Center of Linguistics of the Lisbo...
The extraordinary growth of computer applications, particularly over the last two decades, has enabl...
International audienceIn this article, we present the Brazilian Portuguese Lexicon, a new word-based...
The study reports the results of the exploration of a machine-readable corpus of Brazilian Portugues...
In this work, we present the construction process of a large Web corpus for Brazilian Portuguese, ai...
In this work, we present the construction process of a large Web corpus for Brazilian Portuguese, ai...
The study reports the results of the exploration of a machine-readable corpus of Brazilian Portugues...
We present a newly available on-line resource for Portuguese,a corpus of 310 million words, a ...
The purpose of this paper is to present an overview of Corpus Linguistics, characterizing it as an a...
The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and g...
We present a newly available online resource for Portuguese, a new version of the Reference Corpus o...
Speech recognition systems use statistical methods based algorithms, and therefore need several trai...