Sparse*BERT: Sparse Models are Robust

Campos, Daniel
Marques, Alexandre
Nguyen, Tuan
Kurtz, Mark
Zhai, ChengXiang

Publication date

May 2022

Abstract

Large Language Models have become the core architecture upon which most modern natural language processing (NLP) systems build. These models can consistently deliver impressive accuracy and robustness across tasks and domains, but their high computational overhead can make inference difficult and expensive. To make the usage of these models less costly recent work has explored leveraging structured and unstructured pruning, quantization, and distillation as ways to improve inference speed and decrease size. This paper studies how models pruned using Gradual Unstructured Magnitude Pruning can transfer between domains and tasks. Our experimentation shows that models that are pruned during pretraining using general domain masked language model...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sparse*BERT: Sparse Models are Robust

Abstract

Extracted data

Sparse*BERT: Sparse Models are Robust

Abstract

Extracted data

Related items

Related items