Sparsity-Preserving Differentially Private Training of Large Embedding Models

Ghazi, Badih
Huang, Yangsibo
Kamath, Pritish
Kumar, Ravi
Manurangsi, Pasin
Sinha, Amer
Zhang, Chiyuan

Publication date

November 2023

Language

English

Abstract

As the use of large embedding models in recommendation systems and language applications increases, concerns over user data privacy have also risen. DP-SGD, a training algorithm that combines differential privacy with stochastic gradient descent, has been the workhorse in protecting user privacy without compromising model accuracy by much. However, applying DP-SGD naively to embedding models can destroy gradient sparsity, leading to reduced training efficiency. To address this issue, we present two new algorithms, DP-FEST and DP-AdaFEST, that preserve gradient sparsity during private training of large embedding models. Our algorithms achieve substantial reductions ($10^6 \times$) in gradient size, while maintaining comparable levels of accu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sparsity-Preserving Differentially Private Training of Large Embedding Models

Abstract

Extracted data

Sparsity-Preserving Differentially Private Training of Large Embedding Models

Abstract

Extracted data

Related items

Related items