An Emulator for Fine-Tuning Large Language Models using Small Language Models

Mitchell, Eric
Rafailov, Rafael
Sharma, Archit
Finn, Chelsea
Manning, Christopher D.

Publication date

October 2023

Language

English

Abstract

Widely used language models (LMs) are typically built by scaling up a two-stage training pipeline: a pre-training stage that uses a very large, diverse dataset of text and a fine-tuning (sometimes, 'alignment') stage that uses targeted examples or other specifications of desired behaviors. While it has been hypothesized that knowledge and skills come from pre-training, and fine-tuning mostly filters this knowledge and skillset, this intuition has not been extensively tested. To aid in doing so, we introduce a novel technique for decoupling the knowledge and skills gained in these two stages, enabling a direct answer to the question, "What would happen if we combined the knowledge learned by a large model during pre-training with the knowled...

Extracted data

We use cookies to provide a better user experience.

Data Protection

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Abstract

Extracted data

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Abstract

Extracted data

Topics

Related items

Topics

Related items