Generative models of code, pretrained on large corpora of programs, have shown great success in translating natural language to code (Chen et al., 2021; Austin et al., 2021; Li et al., 2022, inter alia). While these models do not explicitly incorporate program semantics (i.e., execution results) during training, they are able to generate correct solutions for many problems. However, choosing a single correct program from a generated set for each problem remains challenging. In this work, we introduce execution result--based minimum Bayes risk decoding (MBR-EXEC) for program selection and show that it improves the few-shot performance of pretrained code models on natural-language-to-code tasks. We select output programs from a generated cand...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Statistical Machine Translation (SMT) has gained enormous popularity in recent years as natural lan...
Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic ca...
The advent of large language models trained on code (code LLMs) has led to significant progress in l...
Large language models are becoming increasingly practical for translating code across programming la...
Using natural language to write programs is a touchstone problem for computational linguistics. We p...
Research at the intersection of machine learning, programming languages, and software engineering ha...
Source-to-source code translation automatically translates a program from one programming language t...
Few-shot learning with large-scale, pre-trained language models is a powerful way to answer question...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use...
Large language models have demonstrated the ability to condition on and generate both natural langua...
peer reviewedMuch of recent software-engineering research has investigated the naturalness of code, ...
Reasoning over natural language is a long-standing goal for the research community. However, studies...
Software version migration and program translation are an important and costly part of the lifecycle...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Statistical Machine Translation (SMT) has gained enormous popularity in recent years as natural lan...
Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic ca...
The advent of large language models trained on code (code LLMs) has led to significant progress in l...
Large language models are becoming increasingly practical for translating code across programming la...
Using natural language to write programs is a touchstone problem for computational linguistics. We p...
Research at the intersection of machine learning, programming languages, and software engineering ha...
Source-to-source code translation automatically translates a program from one programming language t...
Few-shot learning with large-scale, pre-trained language models is a powerful way to answer question...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use...
Large language models have demonstrated the ability to condition on and generate both natural langua...
peer reviewedMuch of recent software-engineering research has investigated the naturalness of code, ...
Reasoning over natural language is a long-standing goal for the research community. However, studies...
Software version migration and program translation are an important and costly part of the lifecycle...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Statistical Machine Translation (SMT) has gained enormous popularity in recent years as natural lan...
Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic ca...