GPUs are the workhorse in modern server infrastructure fueling advances in a number of compute-intensive workloads such as deep neural network (DNN) training. Several recent works propose solutions on sharing GPU resources across multiple concurrent DNN training jobs, but none of them address rapidly increasing memory footprint introduced by such job co-locations, which greatly limit the effectiveness of sharing GPU resources. In this paper, we present Zico, the first DNN system that aims at reducing the system-wide memory consumption for concurrent training. Zico keeps track of the memory usage pattern of individual training job by monitoring its progress on GPU computations and makes memory reclaimed from the job globally sharable. Based ...
With widespread advances in machine learning, a number of large enterprises are beginning to incorpo...
© 2021 by The USENIX Association.Deep neural networks (DNNs) are widely used in various AI applicati...
One of the reasons behind the tremendous success of deep learning theory and applications in the rec...
The most widely used machine learning frameworks require users to carefully tune their memory usage ...
Memory usage is becoming an increasingly pressing bottleneck in the training process of Deep Neural ...
Deep learning has been widely adopted for different applications of artificial intelligence-speech r...
© 2018 ACM. Going deeper and wider in neural architectures improves their accuracy, while the limite...
Deep Learning, specifically Deep Neural Networks (DNNs), is stressing storage systems in new...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
Popular deep learning frameworks require users to fine-tune their memory usage so that the training ...
International audienceThe resource-hungry and time-consuming process of training Deep Neural Network...
Deep neural networks (DNNs) have grown exponentially in size over the past decade, leaving only thos...
Deep learning-based solutions and, in particular, deep neural networks (DNNs) are at the heart of se...
Deep neural networks (DNNs) have emerged as successful solutions for variety of artificial intellige...
Machine learning (ML) is now omnipresent in all spheres of life. The use of deep neural networks (DN...
With widespread advances in machine learning, a number of large enterprises are beginning to incorpo...
© 2021 by The USENIX Association.Deep neural networks (DNNs) are widely used in various AI applicati...
One of the reasons behind the tremendous success of deep learning theory and applications in the rec...
The most widely used machine learning frameworks require users to carefully tune their memory usage ...
Memory usage is becoming an increasingly pressing bottleneck in the training process of Deep Neural ...
Deep learning has been widely adopted for different applications of artificial intelligence-speech r...
© 2018 ACM. Going deeper and wider in neural architectures improves their accuracy, while the limite...
Deep Learning, specifically Deep Neural Networks (DNNs), is stressing storage systems in new...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
Popular deep learning frameworks require users to fine-tune their memory usage so that the training ...
International audienceThe resource-hungry and time-consuming process of training Deep Neural Network...
Deep neural networks (DNNs) have grown exponentially in size over the past decade, leaving only thos...
Deep learning-based solutions and, in particular, deep neural networks (DNNs) are at the heart of se...
Deep neural networks (DNNs) have emerged as successful solutions for variety of artificial intellige...
Machine learning (ML) is now omnipresent in all spheres of life. The use of deep neural networks (DN...
With widespread advances in machine learning, a number of large enterprises are beginning to incorpo...
© 2021 by The USENIX Association.Deep neural networks (DNNs) are widely used in various AI applicati...
One of the reasons behind the tremendous success of deep learning theory and applications in the rec...