While computational modelling gets more complex and more accurate, its calculation costs have been increasing alike. However, working on big data environments usually involves several steps of massive unfiltered data transmission. In this paper, we continue our work on the PArADISE framework, which enables privacy aware distributed computation of big data scenarios, and present a study on how linear algebra operations can be calculated over parallel relational database systems using SQL. We investigate the ways to improve the computation performance of algebra operations over relational databases and show how using database techniques impacts the computation performance like the use of indexes, choice of schema, query formulation and others...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
To better support decision making, it was proposed to extend SQL to include data cube operations. Co...
While computational modelling gets more complex and more accurate, its calculation costs have been i...
Linear algebra operations appear in nearly every application in advanced analytics, machine learning...
Machine Learning is a research field with substantial relevance for many applications in different a...
In the big data era, the use of large-scale machine learning methods is becoming ubiquitous in data ...
Aggregations help computing summaries of a data set, which are ubiquitous in various big data analyt...
Data summarization is an essential mechanism to accelerate analytic algorithms on large data sets. I...
This dissertation advances the state of the art for scalable high-performance graph analytics and da...
International audienceIn cryptanalysis, solving the discrete logarithm problem (DLP) is key to asses...
In this thesis, I consider the problem of making linear algebra simple to use and efficient to run i...
Scientific computations and analytical business applications are often based on linear algebra opera...
A notable characteristic of the scientific computing and machine learning prob-lem domains is the la...
Many data analysis programs are often expressed in terms of array operations in sequential loops. Ho...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
To better support decision making, it was proposed to extend SQL to include data cube operations. Co...
While computational modelling gets more complex and more accurate, its calculation costs have been i...
Linear algebra operations appear in nearly every application in advanced analytics, machine learning...
Machine Learning is a research field with substantial relevance for many applications in different a...
In the big data era, the use of large-scale machine learning methods is becoming ubiquitous in data ...
Aggregations help computing summaries of a data set, which are ubiquitous in various big data analyt...
Data summarization is an essential mechanism to accelerate analytic algorithms on large data sets. I...
This dissertation advances the state of the art for scalable high-performance graph analytics and da...
International audienceIn cryptanalysis, solving the discrete logarithm problem (DLP) is key to asses...
In this thesis, I consider the problem of making linear algebra simple to use and efficient to run i...
Scientific computations and analytical business applications are often based on linear algebra opera...
A notable characteristic of the scientific computing and machine learning prob-lem domains is the la...
Many data analysis programs are often expressed in terms of array operations in sequential loops. Ho...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
To better support decision making, it was proposed to extend SQL to include data cube operations. Co...