The exponential growth of text documents available on the Internet has created an urgent need for accurate, fast, and general purpose text classification algorithms. However, the "bag of words" representation used for these classification methods is often unsatisfactory as it ignores relationships between important terms that do not co-occur literally. In order to deal with this problem, we integrate background knowledge - in our application: Wikipedia - into the process of classifying text documents. The experimental evaluation on Reuters news/eeds and several other corpus shows that our classificaion results with encyclopedia knowledge are much better than the baseline "bag of words" methods.http://gateway.webofknowled...
Because the rate at which documents are being generated outstrips librarians’ ability to catalog the...
Producing large language corpora is not only highly work-intensive, but also increasingly a process ...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...
When humans approach the task of text categorization, they interpret the specific wording of the doc...
Document classification is a key task for many text min-ing applications. However, traditional text ...
Os métodos tradicionais de classificação de textos normalmente representam documentos apenas como um...
The process whereby inferences are made from textual data is broadly referred to as text mining. In ...
International audienceThis article presents a comparative study of supervised classification approac...
In traditional text clustering methods, documents are represented as “bags of words ” without consid...
Abstract. Over the years many models had been proposed for text catego-rization. One of the most wid...
Ganiz, Murat Can (Dogus Author), Akyokuş, Selim (Dogus Author) -- Full conference title: INISTA 2012...
This article presents a comparative study of supervised classification approaches applied to the aut...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
Structured knowledge representations are becoming central to the area of Information Science. Search...
© 2017 Elsevier Inc. A traditional classification approach based on keyword matching represents each...
Because the rate at which documents are being generated outstrips librarians’ ability to catalog the...
Producing large language corpora is not only highly work-intensive, but also increasingly a process ...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...
When humans approach the task of text categorization, they interpret the specific wording of the doc...
Document classification is a key task for many text min-ing applications. However, traditional text ...
Os métodos tradicionais de classificação de textos normalmente representam documentos apenas como um...
The process whereby inferences are made from textual data is broadly referred to as text mining. In ...
International audienceThis article presents a comparative study of supervised classification approac...
In traditional text clustering methods, documents are represented as “bags of words ” without consid...
Abstract. Over the years many models had been proposed for text catego-rization. One of the most wid...
Ganiz, Murat Can (Dogus Author), Akyokuş, Selim (Dogus Author) -- Full conference title: INISTA 2012...
This article presents a comparative study of supervised classification approaches applied to the aut...
There are many opportunities to improve the interactivity of information retrieval systems beyond th...
Structured knowledge representations are becoming central to the area of Information Science. Search...
© 2017 Elsevier Inc. A traditional classification approach based on keyword matching represents each...
Because the rate at which documents are being generated outstrips librarians’ ability to catalog the...
Producing large language corpora is not only highly work-intensive, but also increasingly a process ...
Wikipedia is a goldmine of information. Each article describes a single concept, and together they c...