Many existing retrieval approaches do not take into account the content quality of the retrieved documents, although link-based measures such as PageRank are commonly used as a form of document prior. In this paper, we present the quality-biased ranking method that promotes documents containing high-quality content, and penalizes low-quality documents. The quality of the document content can be determined by its readability, layout and ease-of-navigation, among other factors. Accordingly, instead of using a single estimate for document quality, we consider multiple content-based features that are directly integrated into a state-of-the-art retrieval method. These content-based features are easy to compute, store and retrieve, even for large...
In the context of Web Search, clustering based engines are emerging as an alternative for the classi...
Large-scale retrieval systems are often implemented as a cascading sequence of phases-a first filter...
Semantic annotations have to satisfy quality constraints to be useful for digital libraries, which i...
Many existing retrieval approaches do not take into account the content quality of the retrieved doc...
The ability to predict retrieval performance has potential applications in many important IR (Inform...
Quality information retrieval for the World Wide Web The World Wide Web is an unregulated communicat...
exploiting hyperlink structure For many topics, the World Wide Web contains hundreds or thousands of...
The World Wide Web is an unregulated communication medium which exhibits very limited means of quali...
In this paper, an approach for the implementation of a quality-based Web search engine is proposed. ...
Currently, search engines rank search results using mainly linkbased metrics. While usually most of ...
Query search engines are fundamental tools in locating documents related to Web surfers ’ interests....
Maximizing only the relevance between queries and documents will not satisfy users if they want the ...
With the growth of web data, how to estimate web page quality effectively and rapidly becomes more a...
Many online or local data sources provide powerful querying mechanisms but limited ranking capabilit...
The paper is concerned with applying learning to rank to document retrieval. Ranking SVM is a typica...
In the context of Web Search, clustering based engines are emerging as an alternative for the classi...
Large-scale retrieval systems are often implemented as a cascading sequence of phases-a first filter...
Semantic annotations have to satisfy quality constraints to be useful for digital libraries, which i...
Many existing retrieval approaches do not take into account the content quality of the retrieved doc...
The ability to predict retrieval performance has potential applications in many important IR (Inform...
Quality information retrieval for the World Wide Web The World Wide Web is an unregulated communicat...
exploiting hyperlink structure For many topics, the World Wide Web contains hundreds or thousands of...
The World Wide Web is an unregulated communication medium which exhibits very limited means of quali...
In this paper, an approach for the implementation of a quality-based Web search engine is proposed. ...
Currently, search engines rank search results using mainly linkbased metrics. While usually most of ...
Query search engines are fundamental tools in locating documents related to Web surfers ’ interests....
Maximizing only the relevance between queries and documents will not satisfy users if they want the ...
With the growth of web data, how to estimate web page quality effectively and rapidly becomes more a...
Many online or local data sources provide powerful querying mechanisms but limited ranking capabilit...
The paper is concerned with applying learning to rank to document retrieval. Ranking SVM is a typica...
In the context of Web Search, clustering based engines are emerging as an alternative for the classi...
Large-scale retrieval systems are often implemented as a cascading sequence of phases-a first filter...
Semantic annotations have to satisfy quality constraints to be useful for digital libraries, which i...