Lion: Adversarial Distillation of Proprietary Large Language Models

Jiang, Yuxin
Chan, Chunkit
Chen, Mingyang
Wang, Wei

Publication date

October 2023

Language

English

Abstract

The practice of transferring knowledge from a sophisticated, proprietary large language model (LLM) to a compact, open-source LLM has garnered considerable attention. Previous works have focused on a unidirectional knowledge distillation way by aligning the responses of the student model with those of the teacher model to a set of instructions. Nevertheless, they overlooked the possibility of incorporating any reciprocal "feedback"--identifying challenging instructions where the student model's performance falls short--to boost the student model's proficiency iteratively. To this end, we propose a novel adversarial distillation framework for a more efficient knowledge transfer. Leveraging the versatile role adaptability of LLMs, we prompt t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Lion: Adversarial Distillation of Proprietary Large Language Models

Abstract

Extracted data

Lion: Adversarial Distillation of Proprietary Large Language Models

Abstract

Extracted data

Related items

Related items