Running faster will only get you so far — it is generally advisable to first understand where the roads lead, then get a car ... The renaissance of machine learning (ML) and deep learning (DL) over the last decade is accompanied by an unscalable computational cost, limiting its advancement and weighing on the field in practice. In this thesis we take a systematic approach to address the algorithmic and methodological limitations at the root of these costs. We first demonstrate that DL training and pruning are predictable and governed by scaling laws — for state of the art models and tasks, spanning image classification and language modeling, as well as for state of the art model compression via iterative pruning. Predictability, via the ...
While machine learning is traditionally a resource intensive task, embedded systems, autonomous navi...
Les algorithmes d'apprentissage profond forment un nouvel ensemble de méthodes puissantes pour l'ap...
We present an extensive evaluation of a wide variety of promising design patterns for automated deep...
Machine learning (ML), a computational self-learning platform, is expected to be applied in a variet...
Neural scaling laws define a predictable relationship between a model's parameter count and its perf...
One long-term goal of machine learning research is to produce methods that are applicable to highly ...
Deep learning's recent history has been one of achievement: from triumphing over humans in the game ...
Deep learning algorithms are responsible for a technological revolution in a variety oftasks includi...
We study the compute-optimal trade-off between model and training data set sizes for large neural ne...
The remarkable practical success of deep learning has revealed some major surprises from a theoretic...
Deep learning has been empirically successful in recent years thanks to the extremely over-parameter...
We analyze the growth of dataset sizes used in machine learning for natural language processing and ...
Deep learning training consumes ever-increasing time and resources, and that isdue to the complexity...
Increased use of data and computation have been the main drivers in Deep Learning for improving perf...
It took until the last decade to finally see a machine match human performance on essentially any ta...
While machine learning is traditionally a resource intensive task, embedded systems, autonomous navi...
Les algorithmes d'apprentissage profond forment un nouvel ensemble de méthodes puissantes pour l'ap...
We present an extensive evaluation of a wide variety of promising design patterns for automated deep...
Machine learning (ML), a computational self-learning platform, is expected to be applied in a variet...
Neural scaling laws define a predictable relationship between a model's parameter count and its perf...
One long-term goal of machine learning research is to produce methods that are applicable to highly ...
Deep learning's recent history has been one of achievement: from triumphing over humans in the game ...
Deep learning algorithms are responsible for a technological revolution in a variety oftasks includi...
We study the compute-optimal trade-off between model and training data set sizes for large neural ne...
The remarkable practical success of deep learning has revealed some major surprises from a theoretic...
Deep learning has been empirically successful in recent years thanks to the extremely over-parameter...
We analyze the growth of dataset sizes used in machine learning for natural language processing and ...
Deep learning training consumes ever-increasing time and resources, and that isdue to the complexity...
Increased use of data and computation have been the main drivers in Deep Learning for improving perf...
It took until the last decade to finally see a machine match human performance on essentially any ta...
While machine learning is traditionally a resource intensive task, embedded systems, autonomous navi...
Les algorithmes d'apprentissage profond forment un nouvel ensemble de méthodes puissantes pour l'ap...
We present an extensive evaluation of a wide variety of promising design patterns for automated deep...