Meta-Learning with Adjoint Methods

Li, Shibo
Wang, Zheng
Narayan, Akil
Kirby, Robert
Zhe, Shandian

Publication date

October 2021

Language

English

Abstract

Model Agnostic Meta-Learning (MAML) is widely used to find a good initialization for a family of tasks. Despite its success, a critical challenge in MAML is to calculate the gradient w.r.t the initialization of a long training trajectory for the sampled tasks, because the computation graph can rapidly explode and the computational cost is very expensive. To address this problem, we propose Adjoint MAML (A-MAML). We view gradient descent in the inner optimization as the evolution of an Ordinary Differential Equation (ODE). To efficiently compute the gradient of the validation loss w.r.t the initialization, we use the adjoint method to construct a companion, backward ODE. To obtain the gradient w.r.t the initialization, we only need to run th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Meta-Learning with Adjoint Methods

Abstract

Extracted data

Meta-Learning with Adjoint Methods

Abstract

Extracted data

Related items

Related items