A common task in both Webmetrics and Web information retrieval is to identify a set of Web pages or sites that are similar in content. In this paper we assess the extent to which links, colinks and couplings can be used to identify similar Web sites. As an experiment, a random sample of 500 pairs of domains from the UK academic Web were taken and human assessments of site similarity, based upon content type, were compared against ratings for the three concepts. The results show that using a combination of all three gives the highest probability of identifying similar sites, but surprisingly this was only a marginal improvement over using links alone. Another unexpected result was that high values for either colink counts or couplings were a...
Investigating the existence of relations between people is the starting point of this research. Prev...
Abstract: Measuring the semantic similarity between two words is an important component in various t...
Much has been written about the potential and pitfalls of macroscopic web-based link analysis, yet t...
Aggregates of links are of interest to information scientists in the same way as citation counts are...
Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The c...
To utilize the similarity information hidden in the Web graph, we investigate the problem of adaptiv...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
To find similar web pages to a query page on the Web, this paper introduces a novel link-based simil...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
This research-in-progress paper presents a new approach called Link Proximity Analysis (LPA) for ide...
Web hyperlink analysis has been a key topic of Webometric research. However, inlink data collection ...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
International audienceIn this paper we review two well-known citation methods to find relatedness be...
Abstract. This research-in-progress paper presents a new approach called Link Proximity Analysis (LP...
In this paper we investigate the effect of using clustering algorithms in the reverse engineering fi...
Investigating the existence of relations between people is the starting point of this research. Prev...
Abstract: Measuring the semantic similarity between two words is an important component in various t...
Much has been written about the potential and pitfalls of macroscopic web-based link analysis, yet t...
Aggregates of links are of interest to information scientists in the same way as citation counts are...
Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The c...
To utilize the similarity information hidden in the Web graph, we investigate the problem of adaptiv...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
To find similar web pages to a query page on the Web, this paper introduces a novel link-based simil...
The World Wide Web provides a wealth of data that can be harnessed to help improve information retri...
This research-in-progress paper presents a new approach called Link Proximity Analysis (LPA) for ide...
Web hyperlink analysis has been a key topic of Webometric research. However, inlink data collection ...
Abstract. Finding pages on the web that are relevant to some user-defined criteria is a longestablis...
International audienceIn this paper we review two well-known citation methods to find relatedness be...
Abstract. This research-in-progress paper presents a new approach called Link Proximity Analysis (LP...
In this paper we investigate the effect of using clustering algorithms in the reverse engineering fi...
Investigating the existence of relations between people is the starting point of this research. Prev...
Abstract: Measuring the semantic similarity between two words is an important component in various t...
Much has been written about the potential and pitfalls of macroscopic web-based link analysis, yet t...