Model Agnostic Meta-Learning (MAML) is widely used to find a good initialization for a family of tasks. Despite its success, a critical challenge in MAML is to calculate the gradient w.r.t the initialization of a long training trajectory for the sampled tasks, because the computation graph can rapidly explode and the computational cost is very expensive. To address this problem, we propose Adjoint MAML (A-MAML). We view gradient descent in the inner optimization as the evolution of an Ordinary Differential Equation (ODE). To efficiently compute the gradient of the validation loss w.r.t the initialization, we use the adjoint method to construct a companion, backward ODE. To obtain the gradient w.r.t the initialization, we only need to run th...
Humans have a remarkable ability to learn new concepts from only a few examples and quickly adapt to...
Model-agnostic meta-learning (MAML) is arguably one of the most popular meta-learning algorithms now...
Despite huge progress in artificial intelligence, the ability to quickly learn from few examples is ...
Model-agnostic meta-learning (MAML) is a meta-learning technique to train a model on a multitude of ...
We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter opt...
Optimization-based meta-learning aims to learn an initialization so that a new unseen task can be le...
Meta-learning algorithms leverage regularities that are present on a set of tasks to speed up and im...
Machine learning algorithms and systems are progressively becoming part of our societies, leading to...
Meta-learning owns unique effectiveness and swiftness in tackling emerging tasks with limited data. ...
The aim of Few-Shot learning methods is to train models which can easily adapt to previously unseen ...
Online learning methods, similar to the online gradient algorithm (OGA) and exponentially weighted a...
Recent developments in few-shot learning have shown that during fast adaption, gradient-based meta-l...
Finding neural network weights that generalize well from small datasets is difficult. A promising ap...
International audienceThis paper addresses a cornerstone of Automated Machine Learning: the problem ...
Deep learning has achieved classification performance matching or exceeding the human one, as long a...
Humans have a remarkable ability to learn new concepts from only a few examples and quickly adapt to...
Model-agnostic meta-learning (MAML) is arguably one of the most popular meta-learning algorithms now...
Despite huge progress in artificial intelligence, the ability to quickly learn from few examples is ...
Model-agnostic meta-learning (MAML) is a meta-learning technique to train a model on a multitude of ...
We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter opt...
Optimization-based meta-learning aims to learn an initialization so that a new unseen task can be le...
Meta-learning algorithms leverage regularities that are present on a set of tasks to speed up and im...
Machine learning algorithms and systems are progressively becoming part of our societies, leading to...
Meta-learning owns unique effectiveness and swiftness in tackling emerging tasks with limited data. ...
The aim of Few-Shot learning methods is to train models which can easily adapt to previously unseen ...
Online learning methods, similar to the online gradient algorithm (OGA) and exponentially weighted a...
Recent developments in few-shot learning have shown that during fast adaption, gradient-based meta-l...
Finding neural network weights that generalize well from small datasets is difficult. A promising ap...
International audienceThis paper addresses a cornerstone of Automated Machine Learning: the problem ...
Deep learning has achieved classification performance matching or exceeding the human one, as long a...
Humans have a remarkable ability to learn new concepts from only a few examples and quickly adapt to...
Model-agnostic meta-learning (MAML) is arguably one of the most popular meta-learning algorithms now...
Despite huge progress in artificial intelligence, the ability to quickly learn from few examples is ...