International audienceThis paper proposes a model for specifying data flow-based parallel data processing programs agnostic of target Big Data processing frameworks. The paper focuses on the formal abstract specification of non-iterative and iterative programs, generalizing the strategies adopted by data flow Big Data processing frameworks. The proposed model relies on Monoid Algebra and Petri Nets to abstract Big Data processing programs in two levels: a higher level representing the program data flow and a lower level representing data transformation operations (e.g., filtering, aggregation, join). We extend the model for data processing programs, for modeling iterative data processing programs. The general specification of these progr...
Big-data is the expression used to describe large data sets, which are complex and require analysis ...
Big data processing is becoming a reality in numerous real-world applications. With the emergence of...
In order to reduce the initial investments needed by small and medium enterprises (SMEs) to acquire ...
This is an extended version of Modeling Big Data Processing Programs, by Joao Batista de Souza Neto,...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
The volume, variety, and velocity properties of big data and the valuable information it contains ha...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
With Cloud Computing emerging as a promising new approach for ad-hoc parallel data processing, major...
In the world of Big Data analytics, there is a series of tools aiming at simplifying programming app...
In the age of Big Data, scalable algorithm implementations as well as powerful computational resourc...
Big data analysis imposes new challenges and requirements on programming support. Programming platfo...
Over the past years, frameworks such as MapReduce and Spark have been introduced to ease the task of...
Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to t...
To run proper Big Data Analytics, small and medium enterprises (SMEs) need to acquire expertise, har...
Big data processing is no longer restricted to specially-trained engineers. Instead, domain experts,...
Big-data is the expression used to describe large data sets, which are complex and require analysis ...
Big data processing is becoming a reality in numerous real-world applications. With the emergence of...
In order to reduce the initial investments needed by small and medium enterprises (SMEs) to acquire ...
This is an extended version of Modeling Big Data Processing Programs, by Joao Batista de Souza Neto,...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
The volume, variety, and velocity properties of big data and the valuable information it contains ha...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
With Cloud Computing emerging as a promising new approach for ad-hoc parallel data processing, major...
In the world of Big Data analytics, there is a series of tools aiming at simplifying programming app...
In the age of Big Data, scalable algorithm implementations as well as powerful computational resourc...
Big data analysis imposes new challenges and requirements on programming support. Programming platfo...
Over the past years, frameworks such as MapReduce and Spark have been introduced to ease the task of...
Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to t...
To run proper Big Data Analytics, small and medium enterprises (SMEs) need to acquire expertise, har...
Big data processing is no longer restricted to specially-trained engineers. Instead, domain experts,...
Big-data is the expression used to describe large data sets, which are complex and require analysis ...
Big data processing is becoming a reality in numerous real-world applications. With the emergence of...
In order to reduce the initial investments needed by small and medium enterprises (SMEs) to acquire ...