Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees

Ju, Haotian
Li, Dongyue
Zhang, Hongyang R.

Publication date

November 2022

Language

English

Abstract

We consider transfer learning approaches that fine-tune a pretrained deep neural network on a target task. We study generalization properties of fine-tuning to understand the problem of overfitting, which commonly occurs in practice. Previous works have shown that constraining the distance from the initialization of fine-tuning improves generalization. Using a PAC-Bayesian analysis, we observe that besides distance from initialization, Hessians affect generalization through the noise stability of deep neural networks against noise injections. Motivated by the observation, we develop Hessian distance-based generalization bounds for a wide range of fine-tuning methods. Additionally, we study the robustness of fine-tuning in the presence of no...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees

Abstract

Extracted data

Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees

Abstract

Extracted data

Related items

Related items