Time-travel text search enriches standard text search by temporal predicates, so that users of web archives can easily retrieve document versions that are considered relevant to a given keyword query and existed during a given time interval. Different index structures have been proposed to effciently support time-travel text search. None of them, however, can easily be updated as the Web evolves and new document versions are added to the web archive. In this work, we describe a novel index structure that effciently supports time-travel text search and can be maintained incrementally as new document versions are added to the web archive. Our solution uses a sharded index organization, bounds the number of spuriously read index entries per sh...
An increasing number of temporally versioned text collections is available today with {W}eb archives...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
We consider in this paper the information retrieval problem over a collection of time-evolving docu...
Time-travel text search enriches standard text search by temporal predicates, so that users of web a...
text-indexing techniques do not provide efficient support for time-travel queries. Further, the high...
Text search over temporally versioned document collections such as web archives has received little ...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Time-travel queries that couple temporal constraints with keyword queries are useful in searching la...
The availability of versioned text collections such as the Internet Archive opens up opportunities...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
In this work we develop a system for letting users search versioned documents – i.e.,\ncollections c...
Modern text analytics applications operate on large volumes of temporal text data such as Web arch...
In temporal document databases and temporal XML databases, temporal text-containment queries are a p...
An increasing number of documents in companies and other organizations are now only available electr...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
An increasing number of temporally versioned text collections is available today with {W}eb archives...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
We consider in this paper the information retrieval problem over a collection of time-evolving docu...
Time-travel text search enriches standard text search by temporal predicates, so that users of web a...
text-indexing techniques do not provide efficient support for time-travel queries. Further, the high...
Text search over temporally versioned document collections such as web archives has received little ...
There have been numerous efforts recently to digitize previously published content and preserving bo...
Time-travel queries that couple temporal constraints with keyword queries are useful in searching la...
The availability of versioned text collections such as the Internet Archive opens up opportunities...
Web archives include both archives of contents originally published on the Web (e.g., the Internet A...
In this work we develop a system for letting users search versioned documents – i.e.,\ncollections c...
Modern text analytics applications operate on large volumes of temporal text data such as Web arch...
In temporal document databases and temporal XML databases, temporal text-containment queries are a p...
An increasing number of documents in companies and other organizations are now only available electr...
A number of emerging large scale applications such as web archiving and time-stamped web objects ge...
An increasing number of temporally versioned text collections is available today with {W}eb archives...
In a text retrieval community, many researchers have shown a good quality of searching a current sna...
We consider in this paper the information retrieval problem over a collection of time-evolving docu...