Over the last years, the ever-growing number of Machine Learning(ML) and Artificial Intelligence(AI) applications deployed in the Cloud has led to high demands on the computing resources required for efficient processing. Multiple users deploy multiple applications on the same server node to maximize Quality of Service(QoS); however, this leads to increased interference. In addition, Cloud providers aim to minimize their operating costs by efficiently utilizing the available resources. These conflicting optimization goals form a complex paradigm where efficient scheduling is required. In this work, we present IRIS, an interference- and resource-aware predictive inference scheduling framework for ML inference serving in the cloud. We targe...
Cloud-based solutions are increasingly being used to implement large-scale dynamic data driven appli...
Over the last number of years there has been high acceleration and widespread adoption of cloud base...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...
A plethora of applications are using machine learning, the operations of which are becoming more com...
As an increasing number of businesses becomes powered by machine-learning, inference becomes a core ...
The rise of distributed cloud computing technologies has been pivotal for the large-scale adoption o...
computing delivers quality of service (QoS) to the end-user. Next, it discusses how to schedule one’...
Better resource utilization is a continuous demand for smart computing paradigm. Fog-to-Cloud (F2C) ...
National audienceThe emergence of Machine Learning (ML) has increased exponentially in numerous appl...
With the advent of ubiquitous deployment of smart devices and the Internet of Things, data sources f...
© 2018 IEEE.As cloud data centers are dramatically growing, various applications are moved to cloud ...
Abstract Automated resource provisioning techniques enable the implementation of elastic services, b...
Traditional resource management techniques that rely on simple heuristics often fail to achieve pred...
TensorFlow, a popular machine learning (ML) platform, allows users to transparently exploit both GPU...
The success of cloud computing builds largely upon on-demand supply of virtual machines (VMs) that p...
Cloud-based solutions are increasingly being used to implement large-scale dynamic data driven appli...
Over the last number of years there has been high acceleration and widespread adoption of cloud base...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...
A plethora of applications are using machine learning, the operations of which are becoming more com...
As an increasing number of businesses becomes powered by machine-learning, inference becomes a core ...
The rise of distributed cloud computing technologies has been pivotal for the large-scale adoption o...
computing delivers quality of service (QoS) to the end-user. Next, it discusses how to schedule one’...
Better resource utilization is a continuous demand for smart computing paradigm. Fog-to-Cloud (F2C) ...
National audienceThe emergence of Machine Learning (ML) has increased exponentially in numerous appl...
With the advent of ubiquitous deployment of smart devices and the Internet of Things, data sources f...
© 2018 IEEE.As cloud data centers are dramatically growing, various applications are moved to cloud ...
Abstract Automated resource provisioning techniques enable the implementation of elastic services, b...
Traditional resource management techniques that rely on simple heuristics often fail to achieve pred...
TensorFlow, a popular machine learning (ML) platform, allows users to transparently exploit both GPU...
The success of cloud computing builds largely upon on-demand supply of virtual machines (VMs) that p...
Cloud-based solutions are increasingly being used to implement large-scale dynamic data driven appli...
Over the last number of years there has been high acceleration and widespread adoption of cloud base...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...