Corpus Analysis for Indexing: when corpus-based terminology makes a difference

  • Débora Oliveira
  • Luís Sarmento
  • Belinda Maia
  • Diana Santos
ORKG logo View in ORKG
Publication date
January 2005

Abstract

This paper describes preliminary work in corpus-based indexing of a sizeable specialized Web portal, with (comparable, but not parallel) information in Portuguese and English. The interdisciplinary work involved illustrates the urgent need to create greater cooperation between information retrieval and corpus-based terminology. The aim of the work described is twofold: to provide a case for terminology-based search engine deployment while improving the availability of the information in a specific Website, and to suggest further measures in order to successfully marry IR and corpus-based terminology. After presenting the Corpógrafo, a fairly mature Web-based environment for terminology work which includes information extraction capabilities...

Extracted data

We use cookies to provide a better user experience.