The document clustering process groups the unstructured text documents into a predefined set of clusters in order to provide more information to the users. There are many studies conducted in clustering monolingual documents. With the enrichment of current technologies, the study of bilingual clustering would not be a problem. However clustering bilingual document is still facing the same problem faced by a monolingual document clustering which is the “curse of dimensionality”. Hence, this encourages the study of term reduction technique in clustering bilingual documents. The objective in this study is to study the effects of reducing terms considered in clustering bilingual corpus in parallel for English and Malay documents. In this study,...
Information retrieval tasks on certain Asian languages have the problem of limited knowledge resourc...
Few studies on text clustering for the Malay language have been conducted due to some limitations th...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...
Document clustering is a process that groups a set of documents based on their similarities. There ...
In this project, we report on our work on applying Hierarchical Agglomerative Clustering (HAC) to a ...
Bilingual corpora, containing the same documents in two different languages, are becoming an essenti...
Multi Multilingual corpora, containing the same documents in a variety of languages, are becoming an...
The generation of texts are dramatically increased in this era. A text basically consists of structu...
Unlabeled research articles published in Malay language are becoming increas ingly common and avail...
AbstractWe propose an evolutionary approach based on genetic algorithm for text document clustering....
Clustering is a technique for grouping objects by their similarity. Document clustering is used for ...
Extended Word Similarity Based (EWSB) Clustering is a word clustering algorithm based on the value o...
Document clustering is the process of organizing a particularelectronic corpus of documents into sub...
In this paper, a variant of a spectral clustering algorithm is proposed for bilingual word cluster...
Document categorization is a widely researched area of information retrieval. A research on Malay na...
Information retrieval tasks on certain Asian languages have the problem of limited knowledge resourc...
Few studies on text clustering for the Malay language have been conducted due to some limitations th...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...
Document clustering is a process that groups a set of documents based on their similarities. There ...
In this project, we report on our work on applying Hierarchical Agglomerative Clustering (HAC) to a ...
Bilingual corpora, containing the same documents in two different languages, are becoming an essenti...
Multi Multilingual corpora, containing the same documents in a variety of languages, are becoming an...
The generation of texts are dramatically increased in this era. A text basically consists of structu...
Unlabeled research articles published in Malay language are becoming increas ingly common and avail...
AbstractWe propose an evolutionary approach based on genetic algorithm for text document clustering....
Clustering is a technique for grouping objects by their similarity. Document clustering is used for ...
Extended Word Similarity Based (EWSB) Clustering is a word clustering algorithm based on the value o...
Document clustering is the process of organizing a particularelectronic corpus of documents into sub...
In this paper, a variant of a spectral clustering algorithm is proposed for bilingual word cluster...
Document categorization is a widely researched area of information retrieval. A research on Malay na...
Information retrieval tasks on certain Asian languages have the problem of limited knowledge resourc...
Few studies on text clustering for the Malay language have been conducted due to some limitations th...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...