International audienceData analytics in the cloud has become an integral part of enterprise businesses. Big data analytics systems, however, still lack the ability to take user performance goals and budgetary constraints for a task, collectively referred to as task objectives, and automatically configure an analytic job to achieve these objectives. This paper presents a data analytics optimizer that can automatically determine a cluster configuration with a suitable number of cores as well as other system parameters that best meet the task objectives. At a core of our work is a principled multi-objective optimization (MOO) approach that computes a Pareto optimal set of job configurations to reveal tradeoffs between different user objectives...
International audienceBig data analytics systems today still lack the ability to take user performan...
International audienceBig data analytics systems today still lack the ability to take user performan...
The last years have seen a steep rise in data generation worldwide, with the development and widespr...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceBig data analytics systems today still lack the ability to take user performan...
Cloud data analytics has become an integral part of enterprisebusiness operations for data-driven in...
The distributed data analytic system -- Spark is a common choice for processing massive volumes of h...
Big Data Optimization is the term used to refer to optimization problems which have to manage very l...
In-memory cluster computing platforms have gained momentum in the last years, due to their ability t...
International audienceDuring the last 10 years, the volume of global data has risen more than tenfol...
Big Data Optimization is the term used to refer to optimiza- tion problems which have to manage ver...
The distributed data analytic system - Spark is a common choice for processing massive volumes of he...
Distributed dataflow systems enable data-parallel processing of large datasets on clusters. Public c...
International audienceBig data analytics systems today still lack the ability to take user performan...
International audienceBig data analytics systems today still lack the ability to take user performan...
The last years have seen a steep rise in data generation worldwide, with the development and widespr...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceData analytics in the cloud has become an integral part of enterprise business...
International audienceBig data analytics systems today still lack the ability to take user performan...
Cloud data analytics has become an integral part of enterprisebusiness operations for data-driven in...
The distributed data analytic system -- Spark is a common choice for processing massive volumes of h...
Big Data Optimization is the term used to refer to optimization problems which have to manage very l...
In-memory cluster computing platforms have gained momentum in the last years, due to their ability t...
International audienceDuring the last 10 years, the volume of global data has risen more than tenfol...
Big Data Optimization is the term used to refer to optimiza- tion problems which have to manage ver...
The distributed data analytic system - Spark is a common choice for processing massive volumes of he...
Distributed dataflow systems enable data-parallel processing of large datasets on clusters. Public c...
International audienceBig data analytics systems today still lack the ability to take user performan...
International audienceBig data analytics systems today still lack the ability to take user performan...
The last years have seen a steep rise in data generation worldwide, with the development and widespr...