Multi Multilingual corpora, containing the same documents in a variety of languages, are becoming an essential resource for natural language processing. Clustering multilingual corpora provides us with an insight into the differences between languages when term frequency-based Information Retrieval (IR) tools are used. It also allows one to use the Natural Language Processing (NLP) and IR tools in one language to implement IR for another language. For instance, in this way, the most relevant articles to be translated from language Malay to language English can be selected after studying the clusters of abstracts in language English. In this paper, we report on our work on applying Hierarchical Agglomerative Clustering (HAC) to a large corpu...
This chapter describes a novel multistage method for linguistic clustering of large collections of t...
Document retrieval process stored in document database often produces very large numbers of document...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...
Bilingual corpora, containing the same documents in two different languages, are becoming an essenti...
In this project, we report on our work on applying Hierarchical Agglomerative Clustering (HAC) to a ...
The document clustering process groups the unstructured text documents into a predefined set of clus...
In this article, we report on our work on applying hierarchical agglomerative clustering (HAC) to a ...
Document clustering is a process that groups a set of documents based on their similarities. There ...
With the development of statistical machine translation, we have ready-to-use tools that can transla...
Unlabeled research articles published in Malay language are becoming increas ingly common and avail...
In this paper, a variant of a spectral clustering algorithm is proposed for bilingual word cluster...
Recognition of Named Entities (NEs) is a dif-ficult process in Indian languages like Hindi, Telugu, ...
In this study a clustering technique has been implemented which is K-Means like with hierarchical in...
Scatter/Gather systems are increasingly becoming useful in browsing document corpora. Usability of t...
Lexicostatistic and language similarity clusters are useful for computational linguistic researches ...
This chapter describes a novel multistage method for linguistic clustering of large collections of t...
Document retrieval process stored in document database often produces very large numbers of document...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...
Bilingual corpora, containing the same documents in two different languages, are becoming an essenti...
In this project, we report on our work on applying Hierarchical Agglomerative Clustering (HAC) to a ...
The document clustering process groups the unstructured text documents into a predefined set of clus...
In this article, we report on our work on applying hierarchical agglomerative clustering (HAC) to a ...
Document clustering is a process that groups a set of documents based on their similarities. There ...
With the development of statistical machine translation, we have ready-to-use tools that can transla...
Unlabeled research articles published in Malay language are becoming increas ingly common and avail...
In this paper, a variant of a spectral clustering algorithm is proposed for bilingual word cluster...
Recognition of Named Entities (NEs) is a dif-ficult process in Indian languages like Hindi, Telugu, ...
In this study a clustering technique has been implemented which is K-Means like with hierarchical in...
Scatter/Gather systems are increasingly becoming useful in browsing document corpora. Usability of t...
Lexicostatistic and language similarity clusters are useful for computational linguistic researches ...
This chapter describes a novel multistage method for linguistic clustering of large collections of t...
Document retrieval process stored in document database often produces very large numbers of document...
EMNLP, Conference on Empirical Methods in Natural Language Processing , Doha, QAT, 25-/10/2014 - 29/...