The growth in size and computational requirements in training Neural Networks (NN) over the past few years has led to an increase in their sizes. In many cases, the networks can grow so large that can no longer fit on a single machine. A model parallel approach, backed by partitioning of Neural Networks and placement of operators on devices in a distributed system, provides a better distributed solution to this problem. In this thesis, we motivate the case for device placement in Neural Networks. We propose, analyze and evaluate mSCT, a polynomial time algorithmic solution to this end. Additionally, we formulate an exponential time optimal ILP solution that models the placement problem. We summarize our contributions as: 1. We propose a theo...
Fast response, storage efficiency, fault tolerance and graceful degradation in face of scarce or spu...
The aim of this work is to describe a possible approach for the optimization of the job scheduling i...
The work is about fundamental parallel machine scheduling problems which occur in manufacturing syst...
The growth in size and computational requirements in training Neural Networks (NN) over the past few...
Many companies, organizations and/or universities have accumulated a large number of computing reso...
We consider a class of problems of scheduling independent jobs on identical, uniform and unrelated p...
Features such as fast response, storage efficiency, fault tolerance and graceful degradation in face...
Modern neural network (NN) models require more data and parameters in or- der to perform ever more c...
Large scale machine learning requires tradeoffs. Commonly this tradeoff has led practitioners to cho...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
During the last years, the amount of data which can be represented and processed as graph structured...
Recent years have witnessed a rapid growth of distributed machine learning (ML) frameworks, which ex...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thanks to the rise of machine learning (ML) and its vast applications, recent years have witnessed a...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
Fast response, storage efficiency, fault tolerance and graceful degradation in face of scarce or spu...
The aim of this work is to describe a possible approach for the optimization of the job scheduling i...
The work is about fundamental parallel machine scheduling problems which occur in manufacturing syst...
The growth in size and computational requirements in training Neural Networks (NN) over the past few...
Many companies, organizations and/or universities have accumulated a large number of computing reso...
We consider a class of problems of scheduling independent jobs on identical, uniform and unrelated p...
Features such as fast response, storage efficiency, fault tolerance and graceful degradation in face...
Modern neural network (NN) models require more data and parameters in or- der to perform ever more c...
Large scale machine learning requires tradeoffs. Commonly this tradeoff has led practitioners to cho...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
During the last years, the amount of data which can be represented and processed as graph structured...
Recent years have witnessed a rapid growth of distributed machine learning (ML) frameworks, which ex...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thanks to the rise of machine learning (ML) and its vast applications, recent years have witnessed a...
Abstract—this paper studies the influence that task placement may have on the performance of applica...
Fast response, storage efficiency, fault tolerance and graceful degradation in face of scarce or spu...
The aim of this work is to describe a possible approach for the optimization of the job scheduling i...
The work is about fundamental parallel machine scheduling problems which occur in manufacturing syst...