Current approaches to program synthesis with Large Language Models (LLMs) exhibit a "near miss syndrome": they tend to generate programs that semantically resemble the correct answer (as measured by text similarity metrics or human evaluation), but achieve a low or even zero accuracy as measured by unit tests due to small imperfections, such as the wrong input or output format. This calls for an approach known as Synthesize, Execute, Debug (SED), whereby a draft of the solution is generated first, followed by a program repair phase addressing the failed tests. To effectively apply this approach to instruction-driven LLMs, one needs to determine which prompts perform best as instructions for LLMs, as well as strike a balance between repairin...
Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tas...
This repository contains the replication package for the paper "Fully Autonomous Programming with La...
In this thesis, we explore techniques for the development of recursive functional programs over unbo...
This paper provides a survey of the emerging area of Large Language Models (LLMs) for Software Engin...
During Automated Program Repair (APR), it can be challenging to synthesize correct patches for real-...
Program synthesis strives to generate a computer program as a solution to a given problem specificat...
Software testing is an important part of the development cycle, yet it requires specialized expertis...
In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension a...
Large Language Models (LLMs) have significantly impacted numerous domains, including Software Engine...
Sequence-to-sequence models have been used to transform erroneous programs into correct ones when tr...
Unit tests play a key role in ensuring the correctness of software. However, manually creating unit ...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Artifact (Docker Container) for our ESEC/FSE'23 Paper: "Copiloting the Copilots: Fusing Large Langua...
This article explores the natural language generation capabilities of large language models with app...
Large language models (LLMs) have recently been integrated in a variety of applications including so...
Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tas...
This repository contains the replication package for the paper "Fully Autonomous Programming with La...
In this thesis, we explore techniques for the development of recursive functional programs over unbo...
This paper provides a survey of the emerging area of Large Language Models (LLMs) for Software Engin...
During Automated Program Repair (APR), it can be challenging to synthesize correct patches for real-...
Program synthesis strives to generate a computer program as a solution to a given problem specificat...
Software testing is an important part of the development cycle, yet it requires specialized expertis...
In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension a...
Large Language Models (LLMs) have significantly impacted numerous domains, including Software Engine...
Sequence-to-sequence models have been used to transform erroneous programs into correct ones when tr...
Unit tests play a key role in ensuring the correctness of software. However, manually creating unit ...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Artifact (Docker Container) for our ESEC/FSE'23 Paper: "Copiloting the Copilots: Fusing Large Langua...
This article explores the natural language generation capabilities of large language models with app...
Large language models (LLMs) have recently been integrated in a variety of applications including so...
Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tas...
This repository contains the replication package for the paper "Fully Autonomous Programming with La...
In this thesis, we explore techniques for the development of recursive functional programs over unbo...