International audienceThe aim of this study is topic identification byusing two methods, in this case, a new one that we haveproposed: TR-classifier which is based on computingtriggers, and the well-known k Nearest Neighbors.Performances are acceptable, particularly for TR-classifier,though we have used reduced sizes of vocabularies. For theTR-Classifier, each topic is represented by a vocabularywhich has been built using the corresponding trainingcorpus. Whereas, the kNN method uses a generalvocabulary, obtained by the concatenation of those used bythe TR-Classifier. For the evaluation task, six topics havebeen selected to be identified: Culture, religion, economy,local news, international news and sports. An Arabic corpushas been used to ...
In recent years, a lot of algorithms have been proposed for the classification of the documents. Mos...
Today, text categorization is usually used in various areas, such as: information retrieval, data mi...
One of the main factors that characterize a text is its content. Nowadays, the number of documents s...
International audienceThe aim of this study is topic identification byusing two methods, in this cas...
International audienceThis paper focuses on studying topic identificationfor Arabic language by usin...
International audienceThis paper focuses on studying topic identificationfor Arabic language by usin...
International audienceTopic Identification is one of the important keysfor the success of many appli...
International audienceTopic Identification is one of the important keysfor the success of many appli...
The quantity of text information published in Arabic language on the net requires the implementatio...
Malay language is the major language that is in used by citizen of Malaysia, Singapore and Brunei. A...
This project presents an implementation of automatic KNN Arabic text categorizer. Six hundred Arabic...
Text classification is the task of assigning a document to one or more of pre-defined categories bas...
With the tremendous amount of electronic documents available, there is a great need to classify docu...
Text categorization, TC, is a process of labeling natural language texts with one or several categor...
International audienceIn this paper we present two well-known methods for topic identification. The ...
In recent years, a lot of algorithms have been proposed for the classification of the documents. Mos...
Today, text categorization is usually used in various areas, such as: information retrieval, data mi...
One of the main factors that characterize a text is its content. Nowadays, the number of documents s...
International audienceThe aim of this study is topic identification byusing two methods, in this cas...
International audienceThis paper focuses on studying topic identificationfor Arabic language by usin...
International audienceThis paper focuses on studying topic identificationfor Arabic language by usin...
International audienceTopic Identification is one of the important keysfor the success of many appli...
International audienceTopic Identification is one of the important keysfor the success of many appli...
The quantity of text information published in Arabic language on the net requires the implementatio...
Malay language is the major language that is in used by citizen of Malaysia, Singapore and Brunei. A...
This project presents an implementation of automatic KNN Arabic text categorizer. Six hundred Arabic...
Text classification is the task of assigning a document to one or more of pre-defined categories bas...
With the tremendous amount of electronic documents available, there is a great need to classify docu...
Text categorization, TC, is a process of labeling natural language texts with one or several categor...
International audienceIn this paper we present two well-known methods for topic identification. The ...
In recent years, a lot of algorithms have been proposed for the classification of the documents. Mos...
Today, text categorization is usually used in various areas, such as: information retrieval, data mi...
One of the main factors that characterize a text is its content. Nowadays, the number of documents s...