Abstract Document clustering has many important applications in the area of data mining and information retrieval. Many existing document clustering techniques use the “bag-of-words ” model to represent the content of a document. However, this repre-sentation is only effective for grouping related documents when these documents share a large proportion of lexically equivalent terms. In other words, instances of synonymy between related documents are ignored, which can reduce the effectiveness of applica-tions using a standard full-text document representation. To address this problem, we present a new approach for clustering scientific documents, based on the utilization of citation contexts. A citation context is essentially the text surro...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
We investigate the accuracy of different similarity approaches for clustering over two million biome...
In this research in progress paper we report on preliminary results from the proposed novel uses of ...
Increasing progress in numerous research fields and information technologies, led to an increase in ...
International audienceIn this paper we focus of the clustering of citation contexts in scientific pa...
International audienceIn this paper we focus of the clustering of citation contexts in scientific pa...
Increased advancement in a variety of study subjects and information technologies, has increased the...
The constant success of the Internet made the number of text documents in electronic forms increases...
In this paper a novel method is proposed for scientific document clustering. The proposed method...
We investigate the accuracy of different similarity approaches for clustering over two million biome...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.Astrobiology is a new an...
The amount of online documents has grown tremendously in recent years that poses challenges for info...
BackgroundWe investigate the accuracy of different similarity approaches for clustering over two mil...
BackgroundWe investigate the accuracy of different similarity approaches for clustering over two mil...
Traditional techniques of document clustering do not consider the semantic relationships between wor...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
We investigate the accuracy of different similarity approaches for clustering over two million biome...
In this research in progress paper we report on preliminary results from the proposed novel uses of ...
Increasing progress in numerous research fields and information technologies, led to an increase in ...
International audienceIn this paper we focus of the clustering of citation contexts in scientific pa...
International audienceIn this paper we focus of the clustering of citation contexts in scientific pa...
Increased advancement in a variety of study subjects and information technologies, has increased the...
The constant success of the Internet made the number of text documents in electronic forms increases...
In this paper a novel method is proposed for scientific document clustering. The proposed method...
We investigate the accuracy of different similarity approaches for clustering over two million biome...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.Astrobiology is a new an...
The amount of online documents has grown tremendously in recent years that poses challenges for info...
BackgroundWe investigate the accuracy of different similarity approaches for clustering over two mil...
BackgroundWe investigate the accuracy of different similarity approaches for clustering over two mil...
Traditional techniques of document clustering do not consider the semantic relationships between wor...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
We investigate the accuracy of different similarity approaches for clustering over two million biome...
In this research in progress paper we report on preliminary results from the proposed novel uses of ...