This paper describes preliminary work in corpus-based indexing of a sizeable specialized Web portal, with (comparable, but not parallel) information in Portuguese and English. The interdisciplinary work involved illustrates the urgent need to create greater cooperation between information retrieval and corpus-based terminology. The aim of the work described is twofold: to provide a case for terminology-based search engine deployment while improving the availability of the information in a specific Website, and to suggest further measures in order to successfully marry IR and corpus-based terminology. After presenting the Corpógrafo, a fairly mature Web-based environment for terminology work which includes information extraction capabilities...
One of the most relevant problems with Information Retrieval (IR) software is the correct processing...
The manual acquisition of terminological data from domain-specific text material is a very time-cons...
In this paper we compare a simple but widely used approach for multi-word indexing in two large coll...
This methodological paper demonstrates how methods from corpus linguistics – a collection of compute...
International audienceThis article presents a new approach in order to index a Web site. It uses ont...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Purpose: Controlled vocabularies play an important role in information retrieval. Numerous studies h...
The role of the Web for text corpus construction is becoming increasingly significant. However, the ...
Abstract. In corpus-based lexicography and natural language processing fields some authors have prop...
Abstract. Multilingual Information Retrieval usually forces a choice between free text indexing or i...
<div><p>In this article, we present the Brazilian Portuguese Lexicon, a new word-based corpus for ps...
In this article, we present the Brazilian Portuguese Lexicon, a new word-based corpus for psycholing...
When searching information in a retrieval system, people use a variety of terms to describe their in...
International audienceComprehensive terminology is essential for a community to describe, exchange, ...
In this research, our theoretical goal is to investigate what characterizes relevant documents for u...
One of the most relevant problems with Information Retrieval (IR) software is the correct processing...
The manual acquisition of terminological data from domain-specific text material is a very time-cons...
In this paper we compare a simple but widely used approach for multi-word indexing in two large coll...
This methodological paper demonstrates how methods from corpus linguistics – a collection of compute...
International audienceThis article presents a new approach in order to index a Web site. It uses ont...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Purpose: Controlled vocabularies play an important role in information retrieval. Numerous studies h...
The role of the Web for text corpus construction is becoming increasingly significant. However, the ...
Abstract. In corpus-based lexicography and natural language processing fields some authors have prop...
Abstract. Multilingual Information Retrieval usually forces a choice between free text indexing or i...
<div><p>In this article, we present the Brazilian Portuguese Lexicon, a new word-based corpus for ps...
In this article, we present the Brazilian Portuguese Lexicon, a new word-based corpus for psycholing...
When searching information in a retrieval system, people use a variety of terms to describe their in...
International audienceComprehensive terminology is essential for a community to describe, exchange, ...
In this research, our theoretical goal is to investigate what characterizes relevant documents for u...
One of the most relevant problems with Information Retrieval (IR) software is the correct processing...
The manual acquisition of terminological data from domain-specific text material is a very time-cons...
In this paper we compare a simple but widely used approach for multi-word indexing in two large coll...