XML has become a widely used and well structured data format for digital document handling and message transmission. To find useful knowledge in XML data, data warehouse and OLAP applications aimed at providing supports for decision making should be developed. Apache Hadoop is an open source cloud computing framework that provides a distributed file system for large scale data processing. In this paper, we discuss an XML data cube model which offers us the complete views to observe XML data, and present a basic algorithm to implement its building process on Hadoop. To improve the efficiency, an optimized algorithm more suitable for this kind of XML data is also proposed. The experimental results given in the paper prove the effectiveness o...
Abstract. MapReduce/Hadoop has gained acceptance as a framework to process, transform, integrate, an...
This paper approach the importance of XML for organizing and managing better the data based on texts...
*Corresponding authors Abstract: Data warehousing and Online Analytical Processing (OLAP) technologi...
XML has become a widely used and well structured data format for digital document handling and messa...
Ambient systems generate large volumes of data for many of their application areas with XML often th...
The current paper shows an end-to-end approach how to process XML files in the Hadoop ecosystem. The...
AbstractAmbient systems generate large volumes of data for many of their application areas with XML ...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
AbstractIn prior work it has been shown that the design of scientific workflows can benefit from a c...
Semi-structured information is often represented in the XML format. Although, a vast amount of appro...
Integrating XML documents in data warehouse is a major issue for decisional data processing and busi...
Data warehousing is an important application of database technology. Even though XML is ubiquitous a...
With the development of computer technologies, the amount of data has explosive growth and the data ...
International audienceThe data warehousing and OLAP technologies are now moving onto handling comple...
In prior work it has been shown that the design of scientific workflows can benefit from a collectio...
Abstract. MapReduce/Hadoop has gained acceptance as a framework to process, transform, integrate, an...
This paper approach the importance of XML for organizing and managing better the data based on texts...
*Corresponding authors Abstract: Data warehousing and Online Analytical Processing (OLAP) technologi...
XML has become a widely used and well structured data format for digital document handling and messa...
Ambient systems generate large volumes of data for many of their application areas with XML often th...
The current paper shows an end-to-end approach how to process XML files in the Hadoop ecosystem. The...
AbstractAmbient systems generate large volumes of data for many of their application areas with XML ...
Abstract—An emerging trend is the use of XML as the data format for many distributed scientific appl...
AbstractIn prior work it has been shown that the design of scientific workflows can benefit from a c...
Semi-structured information is often represented in the XML format. Although, a vast amount of appro...
Integrating XML documents in data warehouse is a major issue for decisional data processing and busi...
Data warehousing is an important application of database technology. Even though XML is ubiquitous a...
With the development of computer technologies, the amount of data has explosive growth and the data ...
International audienceThe data warehousing and OLAP technologies are now moving onto handling comple...
In prior work it has been shown that the design of scientific workflows can benefit from a collectio...
Abstract. MapReduce/Hadoop has gained acceptance as a framework to process, transform, integrate, an...
This paper approach the importance of XML for organizing and managing better the data based on texts...
*Corresponding authors Abstract: Data warehousing and Online Analytical Processing (OLAP) technologi...