We discuss work in progress in the semi-automatic generation of \emph{thematic lexicons} by means of \emph{term categorization}, a novel task employing techniques from information retrieval (IR) and machine learning (ML). Specifically, we view the generation of such lexicons as an iterative process of learning previously unknown associations between terms and \emph{themes} (i.e.\ disciplines, or fields of activity). The process is iterative, in that it generates, for each $c_{i}$ in a set $C=\{c_{1},\ldots,c_{m}\}$ of themes, a sequence $L^{i}_{0}\subseteq L^{i}_{1}\subseteq \ldots \subseteq L^{i}_{n}$ of lexicons, bootstrapping from an initial lexicon $L^{i}_{0}$ and a set of text corpora $\Theta=\{\theta_{0},\ldots,\theta_{n-1}\}$ given a...
Text categorisation is challenging, due to the complex structure with heterogeneous, changing topics...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
In this article we focus on the process of understanding terms in texts. We explain how a method for...
We discuss the automatic generation of \emph{thematic lexicons} by means of \emph{term categorizatio...
We discuss work in progress in the semi-automatic generation of thematic lexicons by means of term c...
We discuss an approach to the automatic expansion of domain-specific lexicons by means of term categ...
Text examples must be exploited in the acquisition of lexical structures. However, neither syntactic...
Automatic text categorisation is a major challenge for information retrieval, information extraction...
We discuss an approach to the automatic expansion of domain-specific lexicons, i.e., to the problem ...
Journal ArticleMany applications need a lexicon that represents semantic information but acquiring l...
Journal ArticleThis paper describes a bootstrapping algorithm called Basilisk that learns high-quali...
In this paper, we present an unsupervised hybrid text-mining approach to automatic acquisition of do...
We present a bootstrapping algorithm to create a semantic lexicon from a list of seed words and a co...
Semantic knowledge can be a great asset to natural language processing systems, but it is usually ha...
We propose a text categorization bootstrapping algorithm in which categories are described by releva...
Text categorisation is challenging, due to the complex structure with heterogeneous, changing topics...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
In this article we focus on the process of understanding terms in texts. We explain how a method for...
We discuss the automatic generation of \emph{thematic lexicons} by means of \emph{term categorizatio...
We discuss work in progress in the semi-automatic generation of thematic lexicons by means of term c...
We discuss an approach to the automatic expansion of domain-specific lexicons by means of term categ...
Text examples must be exploited in the acquisition of lexical structures. However, neither syntactic...
Automatic text categorisation is a major challenge for information retrieval, information extraction...
We discuss an approach to the automatic expansion of domain-specific lexicons, i.e., to the problem ...
Journal ArticleMany applications need a lexicon that represents semantic information but acquiring l...
Journal ArticleThis paper describes a bootstrapping algorithm called Basilisk that learns high-quali...
In this paper, we present an unsupervised hybrid text-mining approach to automatic acquisition of do...
We present a bootstrapping algorithm to create a semantic lexicon from a list of seed words and a co...
Semantic knowledge can be a great asset to natural language processing systems, but it is usually ha...
We propose a text categorization bootstrapping algorithm in which categories are described by releva...
Text categorisation is challenging, due to the complex structure with heterogeneous, changing topics...
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of ...
In this article we focus on the process of understanding terms in texts. We explain how a method for...