Tese de doutoramento, Informática (Engenharia Informática), Universidade de Lisboa, Faculdade de Ciências, 2014Web archives preserve information that was published on the web or digitized from printed publications. Many of that information is unique and historically valuable. However, users do not have dedicated tools to find the desired information, which hampers the usefulness of web archives. This dissertation investigates solutions towards the advance of web archive information retrieval (WAIR) and contributes to the increase of knowledge about its technology and users. The thesis underlying this work is that the search results can be improved by exploiting temporal information intrinsic to web archives. This temporal information was le...
This paper introduces the Portuguese Web Archive initiative, presenting its main objectives and wor...
With the growing importance of the World Wide Web, the major challenges our society faces are also i...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
The Web has become the main publication medium world-wide, covering almost every facet of human acti...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
An important amount of the world s cultural and intellectual knowledge is being created on the webev...
The web became a mass means of publication that has been replacing printed media. However, its infor...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
Every day, unique valuable information that describes our current days disappears from the web. Nati...
International audienceSince late 90s, there has been a large investment in web archiving. Accessing ...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Every day, unique valuable information that describes our current days disappears from the web. Nati...
Web archives constitute valuable sources for researchers in various disciplines. However, their shee...
Web archival materials are not direct traces of the web, they are direct traces of crawlers. By desi...
This paper introduces the Portuguese Web Archive initiative, presenting its main objectives and wor...
With the growing importance of the World Wide Web, the major challenges our society faces are also i...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
The Web has become the main publication medium world-wide, covering almost every facet of human acti...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
An important amount of the world s cultural and intellectual knowledge is being created on the webev...
The web became a mass means of publication that has been replacing printed media. However, its infor...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
Every day, unique valuable information that describes our current days disappears from the web. Nati...
International audienceSince late 90s, there has been a large investment in web archiving. Accessing ...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Every day, unique valuable information that describes our current days disappears from the web. Nati...
Web archives constitute valuable sources for researchers in various disciplines. However, their shee...
Web archival materials are not direct traces of the web, they are direct traces of crawlers. By desi...
This paper introduces the Portuguese Web Archive initiative, presenting its main objectives and wor...
With the growing importance of the World Wide Web, the major challenges our society faces are also i...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...