Vertical search engines attempt to aggregate all available online data for a specific vertical into a normalized and structured data model. There are two common strategies for aggregating data: 1) data feeds, and 2) web crawling. Data feeds use source-specific translation rules to collect structured data, but require the source to specifically expose the data. Web crawling collects data through the same interface that users view it, which requires additional work to identify and extract the relevant content from unstructured or semi-structured text. Generalizing these tasks across many websites is difficult because each website presents content in its own arbitrary way. This thesis proposes a strategy for identifying relevant content across...
We propose and motivate a scheme for classifying queries submitted to a people search engine. We spe...
Today’s web search systems present users with heterogeneous in-formation coming from sources of diff...
Today's web search systems present users with heterogeneous information coming from sources of diffe...
Items that a user can see when he uses the general result page of a modern search engine can be cate...
Vertical search engines allow users to query for information within a subset of documents relevant t...
As the Web evolves unexpectedly fast, information grows explosively. Useful resources become more an...
Aggregating search results from a variety of heterogeneous sources, so-called verticals, such as new...
There is a growing diversity of information access applications. While general web search has been d...
The Web's dynamic,.unstructured nature makes locating resources difficult. Vertical search engines s...
The increasing volume, heterogeneity, and redundancy of the Web create a novel challenge for search ...
This paper addresses the problem of improving the relevance of a search engine results in a vertical...
A machine-based learning approach that combines web content analysis and web structure analysis was ...
As the Web evolves unexpectedly fast, information grows explosively. Useful resources become more an...
The Web is rapidly transforming from a pure document collection to the largest connected public data...
In this paper, we propose a web document ranking method using topic modeling for effective informati...
We propose and motivate a scheme for classifying queries submitted to a people search engine. We spe...
Today’s web search systems present users with heterogeneous in-formation coming from sources of diff...
Today's web search systems present users with heterogeneous information coming from sources of diffe...
Items that a user can see when he uses the general result page of a modern search engine can be cate...
Vertical search engines allow users to query for information within a subset of documents relevant t...
As the Web evolves unexpectedly fast, information grows explosively. Useful resources become more an...
Aggregating search results from a variety of heterogeneous sources, so-called verticals, such as new...
There is a growing diversity of information access applications. While general web search has been d...
The Web's dynamic,.unstructured nature makes locating resources difficult. Vertical search engines s...
The increasing volume, heterogeneity, and redundancy of the Web create a novel challenge for search ...
This paper addresses the problem of improving the relevance of a search engine results in a vertical...
A machine-based learning approach that combines web content analysis and web structure analysis was ...
As the Web evolves unexpectedly fast, information grows explosively. Useful resources become more an...
The Web is rapidly transforming from a pure document collection to the largest connected public data...
In this paper, we propose a web document ranking method using topic modeling for effective informati...
We propose and motivate a scheme for classifying queries submitted to a people search engine. We spe...
Today’s web search systems present users with heterogeneous in-formation coming from sources of diff...
Today's web search systems present users with heterogeneous information coming from sources of diffe...