Deployment of a distributed deep learning technology stack on a large parallel system is a very complex process, involving the integration and configuration of several layers of both, general-purpose and custom software. The details of such kind of deployments are rarely described in the literature. This paper presents the experiences observed during the deployment of a technology stack to enable deep learning workloads on MareNostrum, a petascale supercomputer. The components of a layered architecture, based on the usage of Apache Spark, are described and the performance and scalability of the resulting system is evaluated. This is followed by a discussion about the impact of different configurations including parallelism, storage and netw...
Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries....
Deep learning algorithms base their success on building high learning capacity models with millions ...
Deep neural networks are trained by solving huge optimization problems with large datasets and milli...
Deployment of a distributed deep learning technology stack on a large parallel system is a very comp...
Neural networks are becoming more and more popular in scientific field and in the industry. It is mo...
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a peta...
peer reviewedWith renewed global interest for Artificial Intelligence (AI) methods, the past decade ...
The rapid growth of data and ever increasing model complexity of deep neural networks (DNNs) have en...
Deep learning has been postulated as a solution for numerous problems in different branches of scien...
The field of deep learning has been the focus of plenty of research and development over the last y...
Deep learning has been postulated as a solution for numerous problems in different branches of scien...
Continuously increasing data volumes from multiple sources, such as simulation and experimental meas...
Abstract—In this paper we present a framework to enable data-intensive Spark workloads on MareNostru...
Machine learning has established itself as a powerful tool for the construction of decision making m...
Deep Neural Networks (DNNs) enable computers to excel across many different applications such as ima...
Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries....
Deep learning algorithms base their success on building high learning capacity models with millions ...
Deep neural networks are trained by solving huge optimization problems with large datasets and milli...
Deployment of a distributed deep learning technology stack on a large parallel system is a very comp...
Neural networks are becoming more and more popular in scientific field and in the industry. It is mo...
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a peta...
peer reviewedWith renewed global interest for Artificial Intelligence (AI) methods, the past decade ...
The rapid growth of data and ever increasing model complexity of deep neural networks (DNNs) have en...
Deep learning has been postulated as a solution for numerous problems in different branches of scien...
The field of deep learning has been the focus of plenty of research and development over the last y...
Deep learning has been postulated as a solution for numerous problems in different branches of scien...
Continuously increasing data volumes from multiple sources, such as simulation and experimental meas...
Abstract—In this paper we present a framework to enable data-intensive Spark workloads on MareNostru...
Machine learning has established itself as a powerful tool for the construction of decision making m...
Deep Neural Networks (DNNs) enable computers to excel across many different applications such as ima...
Deep Learning has achieved outstanding results in many fields and led to groundbreaking discoveries....
Deep learning algorithms base their success on building high learning capacity models with millions ...
Deep neural networks are trained by solving huge optimization problems with large datasets and milli...