RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning

Deng, Mingkai
Wang, Jianyu
Hsieh, Cheng-Ping
Wang, Yihan
Guo, Han
Shu, Tianmin
Song, Meng
Xing, Eric P.
Hu, Zhiting

Publication date

May 2022

Abstract

Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicability when gradients are not accessible. Discrete prompt, on the other hand, is difficult to optimize, and is often created by "enumeration (e.g., paraphrasing)-then-selection" heuristics that do not explore the prompt space systematically. This paper proposes RLPrompt, an efficient discrete prompt optimization approach with reinforcement learning ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning

Abstract

Extracted data

RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning

Abstract

Extracted data

Related items

Related items