Fully Autonomous Programming with Large Language Models

Liventsev, Vadim
Grishina, Anastasiia
Härmä, Aki
Moonen, Leon

Open PDF

Open link

Publication date

April 2023

Publisher

Cornell University - arXiv

Language

English

Abstract

Current approaches to program synthesis with Large Language Models (LLMs) exhibit a "near miss syndrome": they tend to generate programs that semantically resemble the correct answer (as measured by text similarity metrics or human evaluation), but achieve a low or even zero accuracy as measured by unit tests due to small imperfections, such as the wrong input or output format. This calls for an approach known as Synthesize, Execute, Debug (SED), whereby a draft of the solution is generated first, followed by a program repair phase addressing the failed tests. To effectively apply this approach to instruction-driven LLMs, one needs to determine which prompts perform best as instructions for LLMs, as well as strike a balance between repairin...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Fully Autonomous Programming with Large Language Models

Abstract

Extracted data

Fully Autonomous Programming with Large Language Models

Abstract

Extracted data

Related items

Related items