A statistical method is proposed for domain-specific term extraction from domain comparative corpora. It takes distribution of a candidate word among domains and within a domain into account. Entropy impurity is used to measure distribution of a word among domains and within a domain. Normalization step is added into the extraction process to cope with unbalanced corpora. So it characterizes attributes of domain-specific term more precisely and more effectively than previous term extraction approaches. Domain-specific terms are applied in text classification as the feature space. Experiments show that it achieves better performance than traditional methods for feature selection
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization tech...
Automatic terminology extraction can be divided into two tasks. The first task measures the Unithood...
A statistical method is proposed for domain-specific term extraction from domain comparative corpora...
In this paper, we propose a method for automatic term recognition (ATR) which uses the statistical d...
Term extraction is an essential task in domain knowledge acquisition. We propose two new measures to...
This chapter focuses on computational approaches to the automatic extraction of terms from domain sp...
Conference paperExtracting general or intermediate level terms is a relevant problem that has not re...
Term extraction is an essential tool for content-based publication analysis, and has a long history ...
Purpose A hybrid approach is presented, which combines linguistic and statistical information to sem...
Existing term extraction systems have predominantly targeted large and well-written document collect...
Some approaches to automatic terminology extraction from corpora imply the use of existing semantic ...
In this paper we present a novel approach to multi–word terminology extraction combining a well–know...
The present article explores two novel methods that integrate distributed representations with termi...
Abstract. We study the problem of extracting terms from research pa-pers, which is an important step...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization tech...
Automatic terminology extraction can be divided into two tasks. The first task measures the Unithood...
A statistical method is proposed for domain-specific term extraction from domain comparative corpora...
In this paper, we propose a method for automatic term recognition (ATR) which uses the statistical d...
Term extraction is an essential task in domain knowledge acquisition. We propose two new measures to...
This chapter focuses on computational approaches to the automatic extraction of terms from domain sp...
Conference paperExtracting general or intermediate level terms is a relevant problem that has not re...
Term extraction is an essential tool for content-based publication analysis, and has a long history ...
Purpose A hybrid approach is presented, which combines linguistic and statistical information to sem...
Existing term extraction systems have predominantly targeted large and well-written document collect...
Some approaches to automatic terminology extraction from corpora imply the use of existing semantic ...
In this paper we present a novel approach to multi–word terminology extraction combining a well–know...
The present article explores two novel methods that integrate distributed representations with termi...
Abstract. We study the problem of extracting terms from research pa-pers, which is an important step...
Domain terms are a useful resource for tuning both resources and NLP processors to domain specific t...
This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization tech...
Automatic terminology extraction can be divided into two tasks. The first task measures the Unithood...