Abstract: This paper investigates the application of text categorization (TC) in an eBusiness setting that exhibits a large number of target categories with relatively few training cases, applied to a real-life online tendering sys-tem. This is an experiment paper showing our experiences in dealing with a real-life application using the conventional machine learning approaches for TC, namely, the Rocchio method, TF-IDF (term frequency-inverse doc-ument frequency), WIDF (weighted inverse document frequency), and näıve Bayes. In order to make the categorization results acceptable for industrial use, we made use of the hierarchical structure of the target categories and investigated the semi-automated ranking categorization
In this paper we describe a new on-line document categorization strategy that can be integrated with...
Graduation date: 2009This paper examines how six online multiclass text classification algorithms pe...
Automatic text categorisation is a major challenge for information retrieval, information extraction...
This paper investigates the application of text categorization (TC) in an eBusiness setting that exh...
This paper investigates the application of text categoriza- tion (TC) in a setting exhibiting a larg...
More and more business opportunities are published on the Web; however, it is difficult to collect a...
More and more business opportunities are published on the Web; however, it is difficult to collect a...
Exploring digital collections to find information relevant to a user's interests is a challenging ta...
With the development of online data, text categorization has become one of the key procedures for ta...
A good text classifier is a classifier that efficiently categorizes large sets of text documents in ...
Modern information society is facing the challenge of handling massive volume of online documents, n...
A “marketplace” is an e-commerce medium where product and inventory information is provided by varyi...
This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selectio...
Text keywords have huge variance and to bridge the gap between the country business segment which pr...
Finding the right business partner to drive innovation or acquire technology transfer is a labor and...
In this paper we describe a new on-line document categorization strategy that can be integrated with...
Graduation date: 2009This paper examines how six online multiclass text classification algorithms pe...
Automatic text categorisation is a major challenge for information retrieval, information extraction...
This paper investigates the application of text categorization (TC) in an eBusiness setting that exh...
This paper investigates the application of text categoriza- tion (TC) in a setting exhibiting a larg...
More and more business opportunities are published on the Web; however, it is difficult to collect a...
More and more business opportunities are published on the Web; however, it is difficult to collect a...
Exploring digital collections to find information relevant to a user's interests is a challenging ta...
With the development of online data, text categorization has become one of the key procedures for ta...
A good text classifier is a classifier that efficiently categorizes large sets of text documents in ...
Modern information society is facing the challenge of handling massive volume of online documents, n...
A “marketplace” is an e-commerce medium where product and inventory information is provided by varyi...
This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selectio...
Text keywords have huge variance and to bridge the gap between the country business segment which pr...
Finding the right business partner to drive innovation or acquire technology transfer is a labor and...
In this paper we describe a new on-line document categorization strategy that can be integrated with...
Graduation date: 2009This paper examines how six online multiclass text classification algorithms pe...
Automatic text categorisation is a major challenge for information retrieval, information extraction...