The rapid increase of web complexity and size makes web searched results far from satisfaction in many cases due to a huge amount of information returned by search engines. How to find intrinsic relationships among the web pages at a higher level to implement efficient web searched information management and retrieval is becoming a challenge problem. In this paper, we propose an approach to measure web page similarity. This approach takes hyperlink transitivity and page importance into consideration. From this new similarity measurement, an effective hierarchical web page clustering algorithm is proposed. The primary evaluations show the effectiveness of the new similarity measurement and the improvement of web page clustering. The proposed...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
User traversals on hyperlinks between Web pages can reveal semantic relationships between these page...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
This paper proposes a hyperlink-based web page similarity measurement and two matrix-based hierarchi...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
Effective & efficient retrieval of the required quality web pages on the web is becoming a great...
To utilize the similarity information hidden in the Web graph, we investigate the problem of adaptiv...
This paper proposes a new algorithm to measure relevance among Web pages (RWP) using a hybrid method...
Clustering is well suited for Web mining by automatically organizing Web pages into categories each ...
Abstract — Relevant information from the web can quickly be retrieved if logically similar webpages ...
In this paper we investigate the effect of using clustering algorithms in the reverse engineering fi...
In this paper, we analyze some clustering algorithms that have been widely employed in the past to s...
Clustering is currently more and more applied on hyperlinked documents, especially for web search re...
To find similar web pages to a query page on the Web, this paper introduces a novel link-based simil...
Web pages clustering is to divide different web pages into different classes according to traveling ...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
User traversals on hyperlinks between Web pages can reveal semantic relationships between these page...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
This paper proposes a hyperlink-based web page similarity measurement and two matrix-based hierarchi...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
Effective & efficient retrieval of the required quality web pages on the web is becoming a great...
To utilize the similarity information hidden in the Web graph, we investigate the problem of adaptiv...
This paper proposes a new algorithm to measure relevance among Web pages (RWP) using a hybrid method...
Clustering is well suited for Web mining by automatically organizing Web pages into categories each ...
Abstract — Relevant information from the web can quickly be retrieved if logically similar webpages ...
In this paper we investigate the effect of using clustering algorithms in the reverse engineering fi...
In this paper, we analyze some clustering algorithms that have been widely employed in the past to s...
Clustering is currently more and more applied on hyperlinked documents, especially for web search re...
To find similar web pages to a query page on the Web, this paper introduces a novel link-based simil...
Web pages clustering is to divide different web pages into different classes according to traveling ...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
User traversals on hyperlinks between Web pages can reveal semantic relationships between these page...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...