We describe a toolkit for highly scalable XML data processing, consisting of two components. The first is a collection of stand-alone XML tools, s.a. sort- ing, aggregation, nesting, and unnesting, that can be chained to express more complex restructurings. The second is a highly scalable XPath processor for XML streams that can be used to develop scalable solutions for XML stream applications. In this paper we dis- cuss the tools, and some of the techniques we used to achieve high scalability. The toolkit is freely available as an open-source project
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the r...
Abstract. We propose a new implementation scheme for XML transformation languages through derivation...
In prior work it has been shown that the design of scientific workflows can benefit from a collectio...
XMLTK: An XML Toolkit for Scalable XML Stream Processing We describe a toolkit for highly scalable X...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
AbstractData Streaming is a necessary and useful technique to process very large XML documentsbut po...
The development of high-throughput genome sequencing and protein structure determination techniques ...
Because XML (extensible markup language) is self-described, there is much redundant structural infor...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
Abstract: Each element and value in a XML stream can be accessed only one time. In this paper, effi...
This paper describes a query processing engine for multiple XML streams, in which series of correlat...
XML has become a standard for document storage and interchange and its convenient syntax improves th...
Publish-subscribe systems present the state of the art in information dissemination to multiple user...
Systems for selective dissemination of information (SDI) are used to efficiently filter, transform, ...
with FLWOR power and functional updates Abstract—The size of XML trees that can be processed by an X...
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the r...
Abstract. We propose a new implementation scheme for XML transformation languages through derivation...
In prior work it has been shown that the design of scientific workflows can benefit from a collectio...
XMLTK: An XML Toolkit for Scalable XML Stream Processing We describe a toolkit for highly scalable X...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
AbstractData Streaming is a necessary and useful technique to process very large XML documentsbut po...
The development of high-throughput genome sequencing and protein structure determination techniques ...
Because XML (extensible markup language) is self-described, there is much redundant structural infor...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
Abstract: Each element and value in a XML stream can be accessed only one time. In this paper, effi...
This paper describes a query processing engine for multiple XML streams, in which series of correlat...
XML has become a standard for document storage and interchange and its convenient syntax improves th...
Publish-subscribe systems present the state of the art in information dissemination to multiple user...
Systems for selective dissemination of information (SDI) are used to efficiently filter, transform, ...
with FLWOR power and functional updates Abstract—The size of XML trees that can be processed by an X...
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the r...
Abstract. We propose a new implementation scheme for XML transformation languages through derivation...
In prior work it has been shown that the design of scientific workflows can benefit from a collectio...