Abstract — Hybrid systems for analyzing big data integrate an analytic tool and a dedicated data-management platform. The necessary movement of data between the components of a hybrid system can lead to performance problems, if that movement is not managed effectively. We present Agrios, a hybrid analytic system for array-structured data, integrating R and SciDB. Agrios minimizes data movement between the two components of the hybrid, using techniques repurposed from relational database query optimization. Keywords—big-array analytics; query optimization; R; SciDB; I
Most data scientists use nowadays functional or semi-functional languages like SQL, Scala or R to tr...
Hybrid data analysis systems integrate an analytic tool and a data management tool. While hybrid sys...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
<p>Statistical analysis of massive array data is becoming indispensable in answering important scien...
Big array analytics is becoming indispensable in answering impor-tant scientific and business questi...
International audienceHybrid complex analytics workloads typically include (i) data management tasks...
textabstractNon-trivial scientific applications often involve complex computations on large multi-di...
Modern industrial, government, and academic organizations are collecting massive amounts of data (“B...
Multi-dimensional arrays (also known as raster data or gridded data) play a key role in many, if not...
textabstractComputing complex statistics on large amounts of data is no longer a corner case, but a ...
<p>Modern industrial, government, and academic organizations are collecting massive amounts of data ...
In order to uncover insights and trends, it is an increasingly common practice for companies of all ...
Scientific applications are generating an ever-increasing volume of multi-dimensional data that are ...
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Recent decades have seen an explosion in the diversity and scale of data analytics tasks. While data...
Most data scientists use nowadays functional or semi-functional languages like SQL, Scala or R to tr...
Hybrid data analysis systems integrate an analytic tool and a data management tool. While hybrid sys...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...
<p>Statistical analysis of massive array data is becoming indispensable in answering important scien...
Big array analytics is becoming indispensable in answering impor-tant scientific and business questi...
International audienceHybrid complex analytics workloads typically include (i) data management tasks...
textabstractNon-trivial scientific applications often involve complex computations on large multi-di...
Modern industrial, government, and academic organizations are collecting massive amounts of data (“B...
Multi-dimensional arrays (also known as raster data or gridded data) play a key role in many, if not...
textabstractComputing complex statistics on large amounts of data is no longer a corner case, but a ...
<p>Modern industrial, government, and academic organizations are collecting massive amounts of data ...
In order to uncover insights and trends, it is an increasingly common practice for companies of all ...
Scientific applications are generating an ever-increasing volume of multi-dimensional data that are ...
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Recent decades have seen an explosion in the diversity and scale of data analytics tasks. While data...
Most data scientists use nowadays functional or semi-functional languages like SQL, Scala or R to tr...
Hybrid data analysis systems integrate an analytic tool and a data management tool. While hybrid sys...
Computing complex statistics on large amounts of data is no longer a corner case, but a daily challe...