Web archives include both archives of contents originally published on the Web (e.g., the Internet Archive) but also archives of contents published long ago that are now accessible on the Web (e.g., the archive of The Times). Thanks to the increased awareness that web-born contents are worth preserving and to improved digitization techniques, web archives have grown in number and size. To unfold their full potential, search techniques are needed that consider their inherent special characteristics. This work addresses three important problems toward this objective and makes the following contributions: * We present the Time-Travel Inverted indeX (TTIX) as an efficient solution to time-travel text search in web archives, allowing users to se...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
Time-travel queries that couple temporal constraints with keyword queries are useful in searching la...
Getting an overview of a historic entity or event can be difficult in search results, especially if ...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
The Web has become the main publication medium world-wide, covering almost every facet of human acti...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Time-travel text search enriches standard text search by temporal predicates, so that users of web a...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
Text search over temporally versioned document collections such as web archives has received little ...
The availability of versioned text collections such as the Internet Archive opens up opportunities f...
text-indexing techniques do not provide efficient support for time-travel queries. Further, the high...
International audienceSince late 90s, there has been a large investment in web archiving. Accessing ...
Tese de doutoramento, Informática (Engenharia Informática), Universidade de Lisboa, Faculdade de Ciê...
We consider in this paper the information retrieval problem over a collection of time-evolving docu...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
Time-travel queries that couple temporal constraints with keyword queries are useful in searching la...
Getting an overview of a historic entity or event can be difficult in search results, especially if ...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
The Web has become the main publication medium world-wide, covering almost every facet of human acti...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Time-travel text search enriches standard text search by temporal predicates, so that users of web a...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
Text search over temporally versioned document collections such as web archives has received little ...
The availability of versioned text collections such as the Internet Archive opens up opportunities f...
text-indexing techniques do not provide efficient support for time-travel queries. Further, the high...
International audienceSince late 90s, there has been a large investment in web archiving. Accessing ...
Tese de doutoramento, Informática (Engenharia Informática), Universidade de Lisboa, Faculdade de Ciê...
We consider in this paper the information retrieval problem over a collection of time-evolving docu...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
Time-travel queries that couple temporal constraints with keyword queries are useful in searching la...
Getting an overview of a historic entity or event can be difficult in search results, especially if ...