Topic Modeling is a statistical process, which derives the latent themes from extensive collections of text. Three approaches to topic modeling exist, namely, unsupervised, semi-supervised and supervised. In this work, we develop a supervised topic model for an Amharic corpus. We also investigate the effect of stemming on topic detection on Term Frequency Inverse Document Frequency (TF-IDF) features, Latent Dirichlet Allocation (LDA) features and a combination of these two feature sets using four supervised machine learning tools, that is, Support Vector Machine (SVM), Naive Bayesian (NB), Logistic Regression (LR), and Neural Nets (NN). We evaluate our approach using an Amharic corpus of 14,751 documents of ten topic categories. Both qualit...
During the past few years, the construction of digitalized content is rapidly increasing, raising th...
A Topic Model is a class of generative probabilistic models which has gained widespread use in compu...
The availability of different pre-trained semantic models has enabled the quick development of machi...
Feature selection is one of the famous solutions to reduce high dimensionality problem of text categ...
This paper is in the field of natural language processing. It applied unsupervised machine learning ...
International audienceTopic Identification is one of the important keysfor the success of many appli...
This paper deals with the problem of automatic theme identification of noisy Arabic texts. Actually,...
The goal of this work has been to investigate how well a high-level task like text classification ca...
The goal of topic detection or topic modelling is to uncover the hidden topics in a large corpus. It...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
The study is on classification of Amharic news automatically using neural networks approach. Learnin...
This paper focuses on the topic identification for the Arabic language based on topic models. We stu...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
In this paper, our focus is the problem of automatic prediction of Parts of Speech of words in Amhar...
Unsupervised statistical analysis of unstructured data has gained wide acceptance especially in natu...
During the past few years, the construction of digitalized content is rapidly increasing, raising th...
A Topic Model is a class of generative probabilistic models which has gained widespread use in compu...
The availability of different pre-trained semantic models has enabled the quick development of machi...
Feature selection is one of the famous solutions to reduce high dimensionality problem of text categ...
This paper is in the field of natural language processing. It applied unsupervised machine learning ...
International audienceTopic Identification is one of the important keysfor the success of many appli...
This paper deals with the problem of automatic theme identification of noisy Arabic texts. Actually,...
The goal of this work has been to investigate how well a high-level task like text classification ca...
The goal of topic detection or topic modelling is to uncover the hidden topics in a large corpus. It...
It is estimated that the world’s data will increase to roughly 160 billion terabytes by 2025, with m...
The study is on classification of Amharic news automatically using neural networks approach. Learnin...
This paper focuses on the topic identification for the Arabic language based on topic models. We stu...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
In this paper, our focus is the problem of automatic prediction of Parts of Speech of words in Amhar...
Unsupervised statistical analysis of unstructured data has gained wide acceptance especially in natu...
During the past few years, the construction of digitalized content is rapidly increasing, raising th...
A Topic Model is a class of generative probabilistic models which has gained widespread use in compu...
The availability of different pre-trained semantic models has enabled the quick development of machi...