International audienceWe present in this paper a clustering algorithm which is based on a cellular automata and which aims at displaying a map of web pages. We describe the main principles of methods that build such maps, and the main principles of cellular automata. We show how these principles can be applied to the problem of web pages clustering: the cells, which are organized in a 2D grid, can be either empty or may contain a page. The local transition function of cells favors the creation of groups of similar states (web pages) in neighbouring cells. We then present the visual results obtained with our method on sets of documents. These documents are thus organized into a visual map which eases the browsing of these pages
In this paper, we analyze some widely employed clustering algorithms to identify duplicated or clone...
Reverse engineering techniques have the potential to support Web site understanding, by providing vi...
In this paper, we analyze some clustering algorithms that have been widely employed in the past to s...
International audienceWe present in this paper a clustering algorithm which is based on a cellular a...
International audienceWe present in this paper a clustering algorithm which is based on a cellular a...
We present in this paper a clustering algorithm which is based on a cellular automaton and which aim...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
Several techniques have been recently proposed to automatically generate Web wrappers, i.e., program...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
Web page clustering is a focal task in Web Mining to organize the content of websites, understanding...
This report deals with segmentation of web pages, which is important discipline of information extra...
In this technique, some documents like HTML are used as entering data. The purpose of this technique...
With the growth of web-based applications and the increasedpopularity of the World Wide Web (WWW), t...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
[[abstract]]Nowadays most of the Web pages contain little amount of structure and supporting informa...
In this paper, we analyze some widely employed clustering algorithms to identify duplicated or clone...
Reverse engineering techniques have the potential to support Web site understanding, by providing vi...
In this paper, we analyze some clustering algorithms that have been widely employed in the past to s...
International audienceWe present in this paper a clustering algorithm which is based on a cellular a...
International audienceWe present in this paper a clustering algorithm which is based on a cellular a...
We present in this paper a clustering algorithm which is based on a cellular automaton and which aim...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
Several techniques have been recently proposed to automatically generate Web wrappers, i.e., program...
We present an approach based on Winner Takes All (WTA), a competitive clustering algorithm, to suppo...
Web page clustering is a focal task in Web Mining to organize the content of websites, understanding...
This report deals with segmentation of web pages, which is important discipline of information extra...
In this technique, some documents like HTML are used as entering data. The purpose of this technique...
With the growth of web-based applications and the increasedpopularity of the World Wide Web (WWW), t...
In this chapter we enhance the representation of web documents by utilizing graphs instead of vector...
[[abstract]]Nowadays most of the Web pages contain little amount of structure and supporting informa...
In this paper, we analyze some widely employed clustering algorithms to identify duplicated or clone...
Reverse engineering techniques have the potential to support Web site understanding, by providing vi...
In this paper, we analyze some clustering algorithms that have been widely employed in the past to s...