Search engines usually get web pages by using links between them. With already massive and ever increasing of web pages, they can only crawl and index a portion of the whole web pages. A model to evaluate their information coverage percentages is presented. We analyze main factors why crawlers can't cover all web information, and put up three kinds of benchmarks to measure the coverage of a search engine. The paper gives out an evaluation model for two of three benchmarks as follows: First, sampling WWW to get many web pages, which are used to check the coverage percentage of quantity through generating random IPs or breadth first search. Second, selecting high-qualified pages as samples of important pages, by HITS or PageRank algorith...
One of the determining factors of the quality of Web search engines is the size of their index. In a...
The biggest information system of World Wide Web indexing is critical to estimate. Web is the benefi...
The purpose of this thesis is to analyse studies that evaluate Web search engines. This is done in f...
Search engines are an important tool for information foraging on the web. The broad details of how ...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
This is an accepted manuscript of an article published by MCB UP Ltd in Journal of Documentation on...
There is an increasing amount of academic and other information on the web [1]. There is also an inc...
Search engines are an important tool for information foraging on the web. The broad details of how t...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
Crawling algorithms have been the subject of extensive research and optimizations, but some importan...
With increasing amount of data in deep web sources (hidden from general search engines behind web fo...
Nowadays people use web search engines to find information. Even though these engines endeavour to p...
Contains fulltext : 142387.pdf (publisher's version ) (Open Access)One of the dete...
Search engines help the user to surf the web. Due to the vast number of web pages it is highly impos...
One of the determining factors of the quality of Web search engines is the size of their index. In a...
The biggest information system of World Wide Web indexing is critical to estimate. Web is the benefi...
The purpose of this thesis is to analyse studies that evaluate Web search engines. This is done in f...
Search engines are an important tool for information foraging on the web. The broad details of how ...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
This is an accepted manuscript of an article published by MCB UP Ltd in Journal of Documentation on...
There is an increasing amount of academic and other information on the web [1]. There is also an inc...
Search engines are an important tool for information foraging on the web. The broad details of how t...
Recent research has studied how to measure the size of a search engine, in terms of the number of pa...
Crawling algorithms have been the subject of extensive research and optimizations, but some importan...
With increasing amount of data in deep web sources (hidden from general search engines behind web fo...
Nowadays people use web search engines to find information. Even though these engines endeavour to p...
Contains fulltext : 142387.pdf (publisher's version ) (Open Access)One of the dete...
Search engines help the user to surf the web. Due to the vast number of web pages it is highly impos...
One of the determining factors of the quality of Web search engines is the size of their index. In a...
The biggest information system of World Wide Web indexing is critical to estimate. Web is the benefi...
The purpose of this thesis is to analyse studies that evaluate Web search engines. This is done in f...