Large Language Models Can Be Strong Differentially Private Learners

Li, Xuechen
Tramèr, Florian
Liang, Percy
Hashimoto, Tatsunori

Publication date

July 2022

Language

English

Abstract

Differentially Private (DP) learning has seen limited success for building large deep learning models of text, and attempts at straightforwardly applying Differentially Private Stochastic Gradient Descent (DP-SGD) to NLP tasks have resulted in large performance drops and high computational overhead. We show that this performance drop can be mitigated with (1) the use of large pretrained models; (2) hyperparameters that suit DP optimization; and (3) fine-tuning objectives aligned with the pretraining procedure. With these factors set right, we obtain private NLP models that outperform state-of-the-art private training approaches and strong non-private baselines -- by directly fine-tuning pretrained models with DP optimization on moderately-s...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Large Language Models Can Be Strong Differentially Private Learners

Abstract

Extracted data

Large Language Models Can Be Strong Differentially Private Learners

Abstract

Extracted data

Related items

Related items