This paper identifies four common misconceptions about the scalability of volunteer computing on big data problems. The misconceptions are then clarified by analyzing the relationship between scalability and the impact factors including the problem size of big data, the heterogeneity and dynamics of volunteers, and the overlay structure. This paper proposes optimization strategies to find the optimal overlay for the given big data problem. This paper forms multiple overlays to optimize the performance of individual steps in terms of MapReduce paradigm. The optimization is to achieve the maximum overall performance by using a minimum number of volunteers, not overusing resources. This paper has demonstrated that the simulations on the concer...
Formulate performance optimization problems of big data workflows with various objects; prove comple...
In the first part of this chapter we illustrate how a big data project can be set up and optimized. ...
This accompanying document for deliverable “D4.4 Resource Optimization Methods and Algorithms” descr...
The discussion context of this paper is big data processing of MapReduce by volunteer computing in d...
A major challenge of using volunteer computing (VC) for big data problems is the opportunistic envir...
It is little to find off-the-shelf research results in the current literature about how competent Vo...
The poor scalability of Volunteer Computing (VC) hinders the application of it because a tremendous ...
Guo, W ORCiD: 0000-0002-3134-3327Volunteer Computing (VC) has been successfully applied to many comp...
The availability of big data sets in research, industry and society in general has opened up many po...
The main objective of this book is to provide the necessary background to work with big data by intr...
The emergence of Big Data applications provides new challenges in data management such as processing...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
Abstract—The emergence of Big Data applications provides new challenges in data management such as p...
Big data processing has recently gained a lot of attention both from academia and industry. The term...
Abstract. Many scientific projects use BOINC middleware to build a volunteer computing project. BOIN...
Formulate performance optimization problems of big data workflows with various objects; prove comple...
In the first part of this chapter we illustrate how a big data project can be set up and optimized. ...
This accompanying document for deliverable “D4.4 Resource Optimization Methods and Algorithms” descr...
The discussion context of this paper is big data processing of MapReduce by volunteer computing in d...
A major challenge of using volunteer computing (VC) for big data problems is the opportunistic envir...
It is little to find off-the-shelf research results in the current literature about how competent Vo...
The poor scalability of Volunteer Computing (VC) hinders the application of it because a tremendous ...
Guo, W ORCiD: 0000-0002-3134-3327Volunteer Computing (VC) has been successfully applied to many comp...
The availability of big data sets in research, industry and society in general has opened up many po...
The main objective of this book is to provide the necessary background to work with big data by intr...
The emergence of Big Data applications provides new challenges in data management such as processing...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
Abstract—The emergence of Big Data applications provides new challenges in data management such as p...
Big data processing has recently gained a lot of attention both from academia and industry. The term...
Abstract. Many scientific projects use BOINC middleware to build a volunteer computing project. BOIN...
Formulate performance optimization problems of big data workflows with various objects; prove comple...
In the first part of this chapter we illustrate how a big data project can be set up and optimized. ...
This accompanying document for deliverable “D4.4 Resource Optimization Methods and Algorithms” descr...