We investigate the complexity of evaluating queries in Relational Algebra (RA) over the relations extracted by regex formulas (i.e., regular expressions with capture variables) over text documents. Such queries, also known as the regular document spanners, were shown to have an evaluation with polynomial delay for every positive RA expression (i.e., consisting of only natural joins, projections and unions); here, the RA expression is fixed and the input consists of both the regex formulas and the document. In this work, we explore the implication of two fundamental generalizations. The first is adopting the “schemaless” semantics for spanners, as proposed and studied by Maturana et al. The second is going beyond the positive RA to allowing ...
The present paper investigates the dynamic complexity of document spanners, a formal framework for i...
We consider the information extraction framework known as document spanners, and study the problem o...
Most modern implementations of regular expression engines allow the use of variables (also called ba...
A document spanner models a program for Information Extraction (IE) as a function that takes as inpu...
Regular expressions with capture variables, also known as regex-formulas,extract relations of spans ...
We examine document spanners, a formal framework for information extraction that was introduced by F...
We examine document spanners, a formal framework for information extraction that was introduced by F...
Document spanners are a formal framework for information extraction that was introduced by [Fagin, K...
This paper investigates regex CQs with string equalities (SERCQs), a subclass of core spanners. As s...
Document spanners are a formal framework for information extraction that was introduced by Fagin, Ki...
Regular expressions and automata models with capture variables are core tools in rule-based informat...
Regular expressions and automata models with capture variables are core tools in rule-based informat...
Regular Expressions (REs) are ubiquitous in database and programming languages. While many applicati...
An intrinsic part of information extraction is the creation and ma-nipulation of relations extracted...
The present paper investigates the dynamic complexity of document spanners, a formal framework for i...
The present paper investigates the dynamic complexity of document spanners, a formal framework for i...
We consider the information extraction framework known as document spanners, and study the problem o...
Most modern implementations of regular expression engines allow the use of variables (also called ba...
A document spanner models a program for Information Extraction (IE) as a function that takes as inpu...
Regular expressions with capture variables, also known as regex-formulas,extract relations of spans ...
We examine document spanners, a formal framework for information extraction that was introduced by F...
We examine document spanners, a formal framework for information extraction that was introduced by F...
Document spanners are a formal framework for information extraction that was introduced by [Fagin, K...
This paper investigates regex CQs with string equalities (SERCQs), a subclass of core spanners. As s...
Document spanners are a formal framework for information extraction that was introduced by Fagin, Ki...
Regular expressions and automata models with capture variables are core tools in rule-based informat...
Regular expressions and automata models with capture variables are core tools in rule-based informat...
Regular Expressions (REs) are ubiquitous in database and programming languages. While many applicati...
An intrinsic part of information extraction is the creation and ma-nipulation of relations extracted...
The present paper investigates the dynamic complexity of document spanners, a formal framework for i...
The present paper investigates the dynamic complexity of document spanners, a formal framework for i...
We consider the information extraction framework known as document spanners, and study the problem o...
Most modern implementations of regular expression engines allow the use of variables (also called ba...