Communication with XML often involves pre-agreed document types. In this paper, we propose an offline parser generation approach to enhance online processing performance for documents con-forming to a given DTD. Our examination of DTDs and the languages they define demonstrates the existence of ambiguities. We present an algorithm that maps DTDs to deterministic context-free grammars defining the same languages. We prove the grammars to be LL(1) and LALR(1), making them suitable for standard parser generators. Our experiments show the superior performance of generated optimized parsers. Our results generalize from DTDs to XML Schema specifications with certain restrictions, most notably the absence of namespaces, which exceed the scope of c...
Stührenberg M, Wurm C. Refining the Taxonomy of XML Schema Languages. A new Approach for Categorizin...
For languages with significant inflectional morphology, development of a morpho-logical parser is of...
Parsing is an expensive operation that can degrade XML processing performance. A survey of four repr...
XML is a standard and universal language for rep- resenting information. XML processing is supported...
This paper presents TDX, a table-driven XML parser. TDX combines parsing and validation into one pas...
The growing synergy between Web Services and Gridbased technologies is enabling profound, dynamic in...
XML delivers key advantages in interoperability due to its flexibility, expressiveness, and platform...
Abstract. This paper describes how to use conventional compiler construction tools, and parser gener...
This paper deals with the expression power and decidability of XML schema lan-guages. In the first s...
This paper examines how documents marked up using the Text Encoding Initiative (TEI) SGML guidelines...
The Standard Generalized Markup Language (SGML) and the Extensible Markup Language (XML) allow autho...
We consider the problem to infer a concise Document Type Definition (DTD) for a given set of XML-doc...
The use of schemas makes an XML-based application more reliable, since they contribute to avoid fail...
The common abstraction of XML Schema by unranked regular tree languages is not entirely accurate. To...
XML (eXtensible Markup Language), a linear syntax for trees, has gathered a remarkable amount of int...
Stührenberg M, Wurm C. Refining the Taxonomy of XML Schema Languages. A new Approach for Categorizin...
For languages with significant inflectional morphology, development of a morpho-logical parser is of...
Parsing is an expensive operation that can degrade XML processing performance. A survey of four repr...
XML is a standard and universal language for rep- resenting information. XML processing is supported...
This paper presents TDX, a table-driven XML parser. TDX combines parsing and validation into one pas...
The growing synergy between Web Services and Gridbased technologies is enabling profound, dynamic in...
XML delivers key advantages in interoperability due to its flexibility, expressiveness, and platform...
Abstract. This paper describes how to use conventional compiler construction tools, and parser gener...
This paper deals with the expression power and decidability of XML schema lan-guages. In the first s...
This paper examines how documents marked up using the Text Encoding Initiative (TEI) SGML guidelines...
The Standard Generalized Markup Language (SGML) and the Extensible Markup Language (XML) allow autho...
We consider the problem to infer a concise Document Type Definition (DTD) for a given set of XML-doc...
The use of schemas makes an XML-based application more reliable, since they contribute to avoid fail...
The common abstraction of XML Schema by unranked regular tree languages is not entirely accurate. To...
XML (eXtensible Markup Language), a linear syntax for trees, has gathered a remarkable amount of int...
Stührenberg M, Wurm C. Refining the Taxonomy of XML Schema Languages. A new Approach for Categorizin...
For languages with significant inflectional morphology, development of a morpho-logical parser is of...
Parsing is an expensive operation that can degrade XML processing performance. A survey of four repr...