With the goal of harvesting all information about a given entity, in this paper, we try to harvest all matching documents for a given query submitted on a search engine. The objective is to retrieve all information about for instance "Michael Jackson", "Islamic State", or "FC Barcelona" from indexed data in search engines, or hidden data behind web forms, using a minimum number of queries. Policies of web search engines usually do not allow accessing all of the matching query search results for a given query. They limit the number of returned documents and the number of user requests. These limitations are also applied in deep web sources, for instance in social networks like Twitter. In this work, we propose a new approach which automatica...
The Web has been rapidly ``deepened" by massive databases online: Recent surveys show that while the...
Information extraction (IE) systems discover structured in-formation from natural language text, to ...
To store the information in a database is one of the major tasks. The efficient storage of data is i...
With the goal of harvesting all information about a given entity, in this paper, we try to harvest a...
In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web ...
In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web ...
In this thesis, we investigate the path towards a focused web harvesting approach which can automati...
The change of the web content is rapid. In Focused Web Harvesting [?], which aims at achieving a com...
As large amount of information is growing in web daily, lots of relevant data are available in the f...
Accessing information is an essential factor in decision making processes occurring in different dom...
Search results generated by searchable databases are served dynamically and far larger than the stat...
Abstract- The web contains a large amount of information which is increasing by magnitude every day....
Categories and Subject Descriptors: H.3.4 [Systems and Software]: Performance evaluation (efficiency...
Search engines are the main hub of information in the Web. They crawl and index Web contents to allo...
The Web is rapidly transforming from a pure document collection to the largest connected public data...
The Web has been rapidly ``deepened" by massive databases online: Recent surveys show that while the...
Information extraction (IE) systems discover structured in-formation from natural language text, to ...
To store the information in a database is one of the major tasks. The efficient storage of data is i...
With the goal of harvesting all information about a given entity, in this paper, we try to harvest a...
In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web ...
In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web ...
In this thesis, we investigate the path towards a focused web harvesting approach which can automati...
The change of the web content is rapid. In Focused Web Harvesting [?], which aims at achieving a com...
As large amount of information is growing in web daily, lots of relevant data are available in the f...
Accessing information is an essential factor in decision making processes occurring in different dom...
Search results generated by searchable databases are served dynamically and far larger than the stat...
Abstract- The web contains a large amount of information which is increasing by magnitude every day....
Categories and Subject Descriptors: H.3.4 [Systems and Software]: Performance evaluation (efficiency...
Search engines are the main hub of information in the Web. They crawl and index Web contents to allo...
The Web is rapidly transforming from a pure document collection to the largest connected public data...
The Web has been rapidly ``deepened" by massive databases online: Recent surveys show that while the...
Information extraction (IE) systems discover structured in-formation from natural language text, to ...
To store the information in a database is one of the major tasks. The efficient storage of data is i...