Deep Learning (DL) models have achieved superior performance. Meanwhile, computing hardware like NVIDIA GPUs also demonstrated strong computing scaling trends with 2x throughput and memory bandwidth for each generation. With such strong computing scaling of GPUs, multi-tenant deep learning inference by co-locating multiple DL models onto the same GPU becomes widely deployed to improve resource utilization, enhance serving throughput, reduce energy cost, etc. However, achieving efficient multi-tenant DL inference is challenging which requires thorough full-stack system optimization. This survey aims to summarize and categorize the emerging challenges and optimization opportunities for multi-tenant DL inference on GPU. By overviewing the enti...
Presented at DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
Computer vision tasks such as image classification have prevalent use and are greatly aided by the d...
The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. Due to its uniq...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
Deep learning is an emerging workload in the field of HPC. This powerful method of resolution is abl...
Data analysts predict that the GPU as a Service (GPUaaS) market will grow from US$700 million in 201...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
Training deep learning (DL) models is a highly compute-intensive task since it involves operating on...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Deep learning models are trained on servers with many GPUs, andtraining must scale with the number o...
Deep learning (DL) has been widely adopted those last years but they are computing-intensive method....
Recent advances in hardware, such as systems with multiple GPUs and their availability in the cloud,...
Deep learning (DL) training jobs now constitute a large portion of the jobs in the GPU clusters. Fol...
The invention of deep belief network (DBN) provides a powerful tool for data modeling. The key advan...
Presented at DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
Computer vision tasks such as image classification have prevalent use and are greatly aided by the d...
The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. Due to its uniq...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
Deep learning is an emerging workload in the field of HPC. This powerful method of resolution is abl...
Data analysts predict that the GPU as a Service (GPUaaS) market will grow from US$700 million in 201...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
Training deep learning (DL) models is a highly compute-intensive task since it involves operating on...
The aim of this project is to conduct a study of deep learning on multi-core processors. The study i...
Deep learning models are trained on servers with many GPUs, andtraining must scale with the number o...
Deep learning (DL) has been widely adopted those last years but they are computing-intensive method....
Recent advances in hardware, such as systems with multiple GPUs and their availability in the cloud,...
Deep learning (DL) training jobs now constitute a large portion of the jobs in the GPU clusters. Fol...
The invention of deep belief network (DBN) provides a powerful tool for data modeling. The key advan...
Presented at DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
Computer vision tasks such as image classification have prevalent use and are greatly aided by the d...