Semi-structured data, like JSON, XML, and their derivatives, are essential in modern computing infrastructures, from cloud computing and microservice to NoSQL data stores and Internet of Things (IoT). However, existing software often fails to process such types of data in a scalable way due to their nested structures and the lack of effective data-level parallelism. The goal of this thesis is mainly to address two fundamental scalability issues in semi-structured data processing. First, how can semi-structured data analytics effectively leverage the abundant hardware parallelism that are offered by modern computer architectures (e.g., multi-cores and SIMD operations)? Second, how can semi-structured data analytics improve the data access e...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Large volumes of data are produced, published and exchanged over the Internet. Such data is often in...
Semi-structured data, like JSON, XML, and their derivatives, are essential in modern computing infra...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
JavaScript Object Notation ( JSON) and its variants have gained great popularity in recent years. Un...
This thesis will address the problems of indexing XML datasets and finding effective searching metho...
International audienceSemi-structured data sets in the form of XML documents have many practical use...
This thesis presents methods for eciently evaluating structural queries over tree-structured data st...
This report is part of our ongoing project on the optimization of stream processing for XPath querie...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.XML and the XPath queryi...
Abstract. XML has been widely adopted across a wide spectrum of applica-tions. Its parsing efficienc...
Numerous applications in for example science, engineering, and financial analysis increasingly requi...
In this dissertation, we address the emerging demand for extending traditional relational support to...
Abstract—Data streaming has become an important paradigm for the real-time processing of continuous ...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Large volumes of data are produced, published and exchanged over the Internet. Such data is often in...
Semi-structured data, like JSON, XML, and their derivatives, are essential in modern computing infra...
In online social networking, network monitoring and finan-cial applications, there is a need to quer...
JavaScript Object Notation ( JSON) and its variants have gained great popularity in recent years. Un...
This thesis will address the problems of indexing XML datasets and finding effective searching metho...
International audienceSemi-structured data sets in the form of XML documents have many practical use...
This thesis presents methods for eciently evaluating structural queries over tree-structured data st...
This report is part of our ongoing project on the optimization of stream processing for XPath querie...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.XML and the XPath queryi...
Abstract. XML has been widely adopted across a wide spectrum of applica-tions. Its parsing efficienc...
Numerous applications in for example science, engineering, and financial analysis increasingly requi...
In this dissertation, we address the emerging demand for extending traditional relational support to...
Abstract—Data streaming has become an important paradigm for the real-time processing of continuous ...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Large volumes of data are produced, published and exchanged over the Internet. Such data is often in...