Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

Yuan, Zhiqiang
Liu, Junwei
Zi, Qiancheng
Liu, Mingwei
Peng, Xin
Lou, Yiling

Publication date

August 2023

Language

English

Abstract

In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension and generation tasks. We have the following main findings. First, for the zero-shot setting, instructed LLMs are very competitive on code comprehension and generation tasks and sometimes even better than small SOTA models specifically fine-tuned on each downstream task. We also find that larger instructed LLMs are not always better on code-related tasks. Second, for the few-shot setting, we find that adding demonstration examples substantially helps instructed LLMs perform better on most code comprehension and generation tasks; however, the examples would sometimes induce unstable or even worse performance. Furthermore, we find widely-used BM2...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

Abstract

Extracted data

Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

Abstract

Extracted data

Related items

Related items