In this paper we investigate a number of questions relating to the identification of the domain of a term by domain classification of the document in which the term occurs. We propose and evaluate a straightforward method for domain classification of documents in 24 languages that exploits a multilingual thesaurus and Wikipedia. We investigate and provide quantitative results about the extent to which humans agree about the domain classification of documents and terms also the extent to which terms are likely to “inherit ” the domain of their parent document.
ABSTRACT. Thesauri are used for document referencing. They define hierarchies of domains. We show ho...
The paper focuses on labelling words by subject in a non-specialized dictionary. We compare the exis...
Abstract: The task of providing dictionaries for all the world's languages is prodigious, re-q...
In this paper we investigate a number of questions relating to the identification of the domain of a...
International audienceThe work presented in this paper addresses the problem of interpretation and s...
The classification of documents is an interesting topic of recent terminological investigations, in ...
International audienceThe work presented in this paper addresses the problem of interpretation and s...
Automatically building domain-specific ontologies is a highly challenging task as it requires extrac...
We discuss an approach to the automatic expansion of domain-specific lexicons by means of term categ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Abstract. In this paper, we deal with the problem of analyzing and classify-ing web documents to sev...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
ABSTRACT. Thesauri are used for document referencing. They define hierarchies of domains. We show ho...
The paper focuses on labelling words by subject in a non-specialized dictionary. We compare the exis...
Abstract: The task of providing dictionaries for all the world's languages is prodigious, re-q...
In this paper we investigate a number of questions relating to the identification of the domain of a...
International audienceThe work presented in this paper addresses the problem of interpretation and s...
The classification of documents is an interesting topic of recent terminological investigations, in ...
International audienceThe work presented in this paper addresses the problem of interpretation and s...
Automatically building domain-specific ontologies is a highly challenging task as it requires extrac...
We discuss an approach to the automatic expansion of domain-specific lexicons by means of term categ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
Abstract. In this paper, we deal with the problem of analyzing and classify-ing web documents to sev...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
ABSTRACT. Thesauri are used for document referencing. They define hierarchies of domains. We show ho...
The paper focuses on labelling words by subject in a non-specialized dictionary. We compare the exis...
Abstract: The task of providing dictionaries for all the world's languages is prodigious, re-q...