In this dissertation, we address the emerging demand for extending traditional relational support to semi-structured data, and for real-time data analytics. Semi-structured data (e.g.,JSON) poses new challenges with large numbers of attributes, sparse attributes and dynamic changes in both workload and data set. In this context, we design, implement and evaluate a novel technique for vertical partitioning of data for optimizing performance of main memory databases. Our partitioning algorithm enables JSON data storage in main memory relational databases by intelligently decomposing JSON objects into different tables. It also adapts to changes in workload and dataset by dynamically refining the current layout. Using the Nobench dataset for JS...
One of the challenging tasks for database administrators is tuning database systems within a short p...
Big data analytics often involves complex join queries over two or more tables. Such join process...
Semantic web database is an RDF database. Tremendous increase can be seen in semantic web data, as r...
Recently there have been increasing demands for real-time analytics on rapidly changing data and for...
Developers often prefer flexibility over upfront schema design, making semi-structured data formats ...
NOSQL database management systems adopt semi-structured data models, such as JSON, to easily accommo...
JavaScript Object Notation was originally designed to transfer data; however, it soon found another ...
Semi-structured data, like JSON, XML, and their derivatives, are essential in modern computing infra...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
Analytical workloads in data warehouses often include heavy joins where queries involve multiple fac...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
JSON is a popular data format which is very flexible since no schema needs to be defined and therefo...
Abstract. Vertical partitioning is a well known technique to improve query response time in relation...
In the last two decades, relational databases for analytics have been specialized to address the nee...
With the explosion of data, large datasets become more common for data analysis. How- ever, existing...
One of the challenging tasks for database administrators is tuning database systems within a short p...
Big data analytics often involves complex join queries over two or more tables. Such join process...
Semantic web database is an RDF database. Tremendous increase can be seen in semantic web data, as r...
Recently there have been increasing demands for real-time analytics on rapidly changing data and for...
Developers often prefer flexibility over upfront schema design, making semi-structured data formats ...
NOSQL database management systems adopt semi-structured data models, such as JSON, to easily accommo...
JavaScript Object Notation was originally designed to transfer data; however, it soon found another ...
Semi-structured data, like JSON, XML, and their derivatives, are essential in modern computing infra...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
Analytical workloads in data warehouses often include heavy joins where queries involve multiple fac...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
JSON is a popular data format which is very flexible since no schema needs to be defined and therefo...
Abstract. Vertical partitioning is a well known technique to improve query response time in relation...
In the last two decades, relational databases for analytics have been specialized to address the nee...
With the explosion of data, large datasets become more common for data analysis. How- ever, existing...
One of the challenging tasks for database administrators is tuning database systems within a short p...
Big data analytics often involves complex join queries over two or more tables. Such join process...
Semantic web database is an RDF database. Tremendous increase can be seen in semantic web data, as r...