Web pages are becoming more complex than ever, as they are generated by Content Management Systems (CMS). Thus, analyzing them, i.e. automatically identifying and classifying different elements from Web pages, such as main content, menus, among others, becomes difficult. A solution to this issue is provided by Web page segmentation which refers to the process of dividing a Web page into visually and semantically coherent segments called blocks.The quality of a Web page segmenter is measured by its correctness and its genericity, i.e. the variety of Web page types it is able to segment. Our research focuses on enhancing this quality and measuring it in a fair and accurate way. We first propose a conceptual model for segmentation, as well as ...
With the regular development of the internet, the accessibility of web sites to every one is essenti...
Web page segmentation into logical blocks is an important preprocessing step for recognizing informa...
The FitLayout library offers a suite of implemented web page segmentation algorithms along with a nu...
Web pages are becoming more complex than ever, as they are generated by Content Management Systems (...
Les pages web sont devenues plus complexes que jamais, principalement parce qu'elles sont générées p...
International audienceIn this paper, we present a framework for evaluating segmentation algorithms f...
International audienceIn this paper we describe Block-o-Matic, a web page segmentation framework. It...
International audienceWeb archives are not exempt of format obsolescence. In the near future Web pag...
We compare three known semantic web page segmentation algorithms, each serving as an example of a pa...
This thesis focuses on segmentation methods. It discusses them at a theoretical level, describes the...
<p>Web pages are typically designed for visual interaction. In order to support visual interaction t...
We describe a new approach for the automatic and objective evaluation of page segmentation (zoning) ...
La segmentation de page est l'une des étapes les plus importantes de l'analyse d'images de documents...
Document page segmentation is one of the most crucial steps in document image analysis. It ideally a...
This report deals with segmentation of web pages, which is important discipline of information extra...
With the regular development of the internet, the accessibility of web sites to every one is essenti...
Web page segmentation into logical blocks is an important preprocessing step for recognizing informa...
The FitLayout library offers a suite of implemented web page segmentation algorithms along with a nu...
Web pages are becoming more complex than ever, as they are generated by Content Management Systems (...
Les pages web sont devenues plus complexes que jamais, principalement parce qu'elles sont générées p...
International audienceIn this paper, we present a framework for evaluating segmentation algorithms f...
International audienceIn this paper we describe Block-o-Matic, a web page segmentation framework. It...
International audienceWeb archives are not exempt of format obsolescence. In the near future Web pag...
We compare three known semantic web page segmentation algorithms, each serving as an example of a pa...
This thesis focuses on segmentation methods. It discusses them at a theoretical level, describes the...
<p>Web pages are typically designed for visual interaction. In order to support visual interaction t...
We describe a new approach for the automatic and objective evaluation of page segmentation (zoning) ...
La segmentation de page est l'une des étapes les plus importantes de l'analyse d'images de documents...
Document page segmentation is one of the most crucial steps in document image analysis. It ideally a...
This report deals with segmentation of web pages, which is important discipline of information extra...
With the regular development of the internet, the accessibility of web sites to every one is essenti...
Web page segmentation into logical blocks is an important preprocessing step for recognizing informa...
The FitLayout library offers a suite of implemented web page segmentation algorithms along with a nu...