We present a novel approach for parallelizing the execution of queries over XML documents, implemented within our system PAXQuery. We compile a rich subset of XQuery into plans expressed in the PArallelization ConTracts (PACT) programming model. These plans are then optimized and executed in parallel by the Stratosphere system. We demonstrate the efficiency and scalability of our approach through experiments on hundreds of GB of XML data.nonnonrechercheInternationa
ABSTRACT: Distributed Query Processing is an efficient processing method for large XML data by parti...
While for languages like XPath there are many processing techniques that use very little memory, the...
Relational XQuery systems try to re-use mature relational data management infrastructures to create ...
International audienceWe present a novel approach for parallelizing the execution of queries over XM...
International audienceXQuery is a general-purpose programming language for processing semi-structure...
National audienceIncreasing volumes of data are being produced and exchanged over the Web, in partic...
Increasing use of XML has emphasized the need for scalable database systems that are capable of hand...
Abstract — A language for semi-structured documents, XML has emerged as the core of the web services...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
Very large scientific datasets are becoming increasingly available in XML formats. Our earlier bench...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.XML and the XPath queryi...
In contrast to relational databases the distribution of document-centric XML is not well researched....
Abstract. In contrast to relational databases the distribution of document-centric XML is not well r...
In this study, we present experiences of parallelizing XPath queries using the Xalan XPath engine on...
ABSTRACT: Distributed Query Processing is an efficient processing method for large XML data by parti...
While for languages like XPath there are many processing techniques that use very little memory, the...
Relational XQuery systems try to re-use mature relational data management infrastructures to create ...
International audienceWe present a novel approach for parallelizing the execution of queries over XM...
International audienceXQuery is a general-purpose programming language for processing semi-structure...
National audienceIncreasing volumes of data are being produced and exchanged over the Web, in partic...
Increasing use of XML has emphasized the need for scalable database systems that are capable of hand...
Abstract — A language for semi-structured documents, XML has emerged as the core of the web services...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
Very large scientific datasets are becoming increasingly available in XML formats. Our earlier bench...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.XML and the XPath queryi...
In contrast to relational databases the distribution of document-centric XML is not well researched....
Abstract. In contrast to relational databases the distribution of document-centric XML is not well r...
In this study, we present experiences of parallelizing XPath queries using the Xalan XPath engine on...
ABSTRACT: Distributed Query Processing is an efficient processing method for large XML data by parti...
While for languages like XPath there are many processing techniques that use very little memory, the...
Relational XQuery systems try to re-use mature relational data management infrastructures to create ...