With the emergence of various big data platforms in recent years, Apache Spark - a distributed large-scale computing platform, is perceived as a potential substitute for Message Passing Interface (MPI) in High Performance Computing (HPC). Due to the limitations in fault-tolerance, dynamic resource handling and ease of use, MPI, as a dominant method to achieve parallel computing in HPC, is often associated with higher development time and costs in enterprises such as Scania IT. This thesis project aims to examine Apache Spark as an alternative to MPI on HPC clusters and compare their performance in various aspects. The test results are obtained by running a compute- intensive application on both platforms to solve a Bayesian inference proble...
AbstractOne of the biggest challenges of the current big data landscape is our inability to pro- ces...
In the era of Big Data, machine learning has taken on a whole new role. With the amount of data pres...
Project Specification The goal of this openlab summer student project is to analyse Apache Spark as...
With the emergence of various big data platforms in recent years, Apache Spark - a distributed large...
As dataset sizes increase, data analysis tasks in high performance computing (HPC) are increasingly ...
Due to the latest development in the context of Internet of Things, the amount of generated and coll...
International audienceBig Data analytics frameworks (e.g., Apache Hadoop and Apache Spark) have been...
Este trabalho compara o desempenho e a estabilidade de dois arcabouços para o processamento de Big D...
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a peta...
One of the biggest challenges of the current big data landscape is our inability to process vast amo...
Abstract—In this paper we present a framework to enable data-intensive Spark workloads on MareNostru...
The digital era's requirements pose many challenges related to deployment, implementation and effici...
Big Data applications allow to successfully analyze large amounts of data not necessarily structured...
We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C...
Spark has been established as an attractive platform for big data analysis, since it manages to hide...
AbstractOne of the biggest challenges of the current big data landscape is our inability to pro- ces...
In the era of Big Data, machine learning has taken on a whole new role. With the amount of data pres...
Project Specification The goal of this openlab summer student project is to analyse Apache Spark as...
With the emergence of various big data platforms in recent years, Apache Spark - a distributed large...
As dataset sizes increase, data analysis tasks in high performance computing (HPC) are increasingly ...
Due to the latest development in the context of Internet of Things, the amount of generated and coll...
International audienceBig Data analytics frameworks (e.g., Apache Hadoop and Apache Spark) have been...
Este trabalho compara o desempenho e a estabilidade de dois arcabouços para o processamento de Big D...
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a peta...
One of the biggest challenges of the current big data landscape is our inability to process vast amo...
Abstract—In this paper we present a framework to enable data-intensive Spark workloads on MareNostru...
The digital era's requirements pose many challenges related to deployment, implementation and effici...
Big Data applications allow to successfully analyze large amounts of data not necessarily structured...
We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C...
Spark has been established as an attractive platform for big data analysis, since it manages to hide...
AbstractOne of the biggest challenges of the current big data landscape is our inability to pro- ces...
In the era of Big Data, machine learning has taken on a whole new role. With the amount of data pres...
Project Specification The goal of this openlab summer student project is to analyse Apache Spark as...