ISBN 2-906855-18-9Web resources are more and more different, not only regarding thematic content but also related to type of document, geographic origin, level, language, etc. However, web search engines do not take into account this heterogeneity and propose only a thematic access by keywords to the documents. This paper presents a method allowing to extract homogenous corpus of web documents. This method based on link analysis uses co-citation method and focuses more specially on the notion of type of web documents.Les ressources disponibles sur le Web sont de plus en plus diverses aussi bien d'un point de vue thématique, qu'au niveau de leur type, de leur origine géographique, etc. Cependant, les outils de recherche ne prennent pas en co...
The Web and its tools of orientation: how better to understand the information available on the Inte...
The use of hypertext links on the web makes sites more attractive and easier to read and allows enri...
International audienceGiven the large heterogeneity of the World Wide Web, using metadata on the sea...
ISBN 2-906855-18-9Web resources are more and more different, not only regarding thematic content but...
Web resources are more and more different, not only regarding thematic content but also related to t...
Web resources are more and more different, not only regarding thematic content but also related to t...
In this thesis, which is part and parcel of the more general context of web information retrieval, w...
Dans cette thèse, qui s'inscrit dans le contexte général de la recherche d'information sur la Toile,...
Web resources are more and more different, not only regarding thematic content but also related to t...
MultilingualWeb Document (MWD) processing has become one of the major interests of research and deve...
International audienceThe authors, who publish knowledge on the Web related to readable electronic d...
International audienceThe authors who publish knowledge on the Web in readable electronic documents ...
The Web is a huge source of information, and one of the main problems facing users is finding docume...
We describe a real experiment in order to build a thematic index of a scientific book. This book is ...
Since its foundation in May 2009, the médialab Sciences Po works to foster the use of digital method...
The Web and its tools of orientation: how better to understand the information available on the Inte...
The use of hypertext links on the web makes sites more attractive and easier to read and allows enri...
International audienceGiven the large heterogeneity of the World Wide Web, using metadata on the sea...
ISBN 2-906855-18-9Web resources are more and more different, not only regarding thematic content but...
Web resources are more and more different, not only regarding thematic content but also related to t...
Web resources are more and more different, not only regarding thematic content but also related to t...
In this thesis, which is part and parcel of the more general context of web information retrieval, w...
Dans cette thèse, qui s'inscrit dans le contexte général de la recherche d'information sur la Toile,...
Web resources are more and more different, not only regarding thematic content but also related to t...
MultilingualWeb Document (MWD) processing has become one of the major interests of research and deve...
International audienceThe authors, who publish knowledge on the Web related to readable electronic d...
International audienceThe authors who publish knowledge on the Web in readable electronic documents ...
The Web is a huge source of information, and one of the main problems facing users is finding docume...
We describe a real experiment in order to build a thematic index of a scientific book. This book is ...
Since its foundation in May 2009, the médialab Sciences Po works to foster the use of digital method...
The Web and its tools of orientation: how better to understand the information available on the Inte...
The use of hypertext links on the web makes sites more attractive and easier to read and allows enri...
International audienceGiven the large heterogeneity of the World Wide Web, using metadata on the sea...