Machine learning has been widely used in various application domains such as recommendation, computer vision, natural language processing, etc. The procedure that uses a trained model to perform prediction on unseen data, namely, deep learning inference is widely accelerated by dedicated SIMD/SIMT accelerators, such as GPUs. The existing deep learning inference frameworks utilize dedicated accelerators in a continuously host-instruct-device fashion. Specifically, for each operator in a computation graph, the deep learning framework runtime launches computation kernels from the host system, delegating computation task to a dedicated device accelerator. However, these deep learning frameworks’ runtime often introduces significant execution o...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
This thesis presents a few methods to accelerate the inference of Deep Neural Networks that are lar...
Deep learning algorithms have seen success in a wide variety of applications, such as machine transl...
Deep neural networks (DNNs) are becoming a key enabling technique for many application domains. Howe...
Deep learning (DL) has been widely adopted those last years but they are computing-intensive method....
Deep learning has demonstrated high accuracy and efficiency in various applications. For example, Co...
The latest Deep Learning (DL) methods for designing Deep Neural Networks (DNN) have significantly ex...
In recent years, deep neural networks (DNNs) have revolutionized the field of machine learning. DNNs...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
Recent advances in Deep Learning (DL) research have been adopted in a wide variety of applications, ...
In recent years, the accuracy of Deep Neural Networks (DNNs) has improved significantly because of t...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
After a decade of accelerated progress in the different areas of machine learning (ML), it has becom...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...
This thesis presents a few methods to accelerate the inference of Deep Neural Networks that are lar...
Deep learning algorithms have seen success in a wide variety of applications, such as machine transl...
Deep neural networks (DNNs) are becoming a key enabling technique for many application domains. Howe...
Deep learning (DL) has been widely adopted those last years but they are computing-intensive method....
Deep learning has demonstrated high accuracy and efficiency in various applications. For example, Co...
The latest Deep Learning (DL) methods for designing Deep Neural Networks (DNN) have significantly ex...
In recent years, deep neural networks (DNNs) have revolutionized the field of machine learning. DNNs...
Machine Learning (ML) frameworks are tools that facilitate the development and deployment of ML mode...
Recent advances in Deep Learning (DL) research have been adopted in a wide variety of applications, ...
In recent years, the accuracy of Deep Neural Networks (DNNs) has improved significantly because of t...
Our work seeks to improve and adapt computing systems and machine learning (ML) algorithms to match ...
After a decade of accelerated progress in the different areas of machine learning (ML), it has becom...
We present a library that provides optimized implementations for deep learning primitives. Deep lear...
We devise a performance model for GPU training of Deep Learning Recommendation Models (DLRM), whose ...
With the rapid growth of deep learning models and higher expectations for their accuracy and through...