There is a great amount of information on the web that can not be accessed by conventional crawler engines. This portion of the web is usually called hidden web data. To be able to deal with this problem, it is necessary to solve two tasks: crawling the client-side and crawling the server-side hidden web. In this paper we present an architecture and a set of related techniques for accessing the information placed in the client-side hidden web, dealing with aspects such as JavaScript technology, non-standard session maintenance mechanisms, client redirections, pop-up menus, etc. Our approach leverages current browser APIs and implements novel crawling models and algorithms
JavaScript Client-side hidden web pages (CSHW) contain dynamic material created as a result of speci...
During the past decade, web applications have evolved substantially. Taking advantage of new technol...
Web applications have come a long way both in terms of adoption to provide information and services ...
Abstract. There is a great amount of information on the web that can not be ac-cessed by conventiona...
Client-side JavaScript is increasingly used for enhancing web application functionality, interactivi...
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of web pag...
Web Crawler forms the back-bone of applications that facilitate Web information retrieval. Generic c...
Abstract- The web contains a large amount of information which is increasing by magnitude every day....
Web search engines use web crawlers that follow hyperlinks. This technique is ideal for discovering ...
Abstract. Client-side JavaScript is increasingly used for enhancing web application functionality, i...
In this paper, Web Crawling systems are investigated. Such systems are mostly used in Web archiving,...
The number of applications that need to crawl the Web to gather data is growing at an ever increasin...
Web application scanners are popular tools to perform black box testing and are widely used to disco...
Abstract- A web crawler is a software program that browses the web in a very systematic manner. Craw...
AJAX is a very promising approach for improving rich interactivity and responsiveness of web applica...
JavaScript Client-side hidden web pages (CSHW) contain dynamic material created as a result of speci...
During the past decade, web applications have evolved substantially. Taking advantage of new technol...
Web applications have come a long way both in terms of adoption to provide information and services ...
Abstract. There is a great amount of information on the web that can not be ac-cessed by conventiona...
Client-side JavaScript is increasingly used for enhancing web application functionality, interactivi...
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of web pag...
Web Crawler forms the back-bone of applications that facilitate Web information retrieval. Generic c...
Abstract- The web contains a large amount of information which is increasing by magnitude every day....
Web search engines use web crawlers that follow hyperlinks. This technique is ideal for discovering ...
Abstract. Client-side JavaScript is increasingly used for enhancing web application functionality, i...
In this paper, Web Crawling systems are investigated. Such systems are mostly used in Web archiving,...
The number of applications that need to crawl the Web to gather data is growing at an ever increasin...
Web application scanners are popular tools to perform black box testing and are widely used to disco...
Abstract- A web crawler is a software program that browses the web in a very systematic manner. Craw...
AJAX is a very promising approach for improving rich interactivity and responsiveness of web applica...
JavaScript Client-side hidden web pages (CSHW) contain dynamic material created as a result of speci...
During the past decade, web applications have evolved substantially. Taking advantage of new technol...
Web applications have come a long way both in terms of adoption to provide information and services ...