Recently, scores of high-performing code generation systems have surfaced. As has become a popular choice in many domains, code generation is often approached using large language models as a core, trained under the masked or causal language modeling schema. This work shows that current code generation systems exhibit biases inherited from large language model backbones, which might leak into generated code under specific circumstances. To investigate the effect, we propose a framework that automatically removes hints and exposes various biases that these code generation models use. We apply our framework to three coding challenges and test it across top-performing coding generation models. Our experiments reveal biases towards specific p...
Neural language models often fail to generate diverse and informative texts, limiting their applicab...
Current language generation models suffer from issues such as repetition, incoherence, and hallucina...
Predictive modeling using machine learning is an effective method for building compiler heuristics, ...
Utilizing state-of-the-art Large Language Models (LLMs), automatic code generation models play a piv...
Large language models generate complex, open-ended outputs: instead of outputting a class label they...
Data generation and analysis is a fundamental aspect of many industries and disciplines, from strate...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Few-shot learning with large-scale, pre-trained language models is a powerful way to answer question...
Thesis (Ph.D.)--University of Washington, 2020Modern machine learning algorithms have been able to a...
Large language models (LLMs) have garnered significant attention for their remarkable performance in...
Large Transformer models achieved the state-of-the-art status for Natural Language Understanding tas...
This paper systematically investigates the generation of code explanations by Large Language Models ...
Neural language models are increasingly deployed into APIs and websites that allow a user to pass in...
Developing models that can automatically generate detailed code explanation can greatly benefit soft...
One of the most common solutions adopted by software researchers to address code generation is by tr...
Neural language models often fail to generate diverse and informative texts, limiting their applicab...
Current language generation models suffer from issues such as repetition, incoherence, and hallucina...
Predictive modeling using machine learning is an effective method for building compiler heuristics, ...
Utilizing state-of-the-art Large Language Models (LLMs), automatic code generation models play a piv...
Large language models generate complex, open-ended outputs: instead of outputting a class label they...
Data generation and analysis is a fundamental aspect of many industries and disciplines, from strate...
Recent Language Models (LMs) achieve breakthrough performance in code generation when trained on hum...
Few-shot learning with large-scale, pre-trained language models is a powerful way to answer question...
Thesis (Ph.D.)--University of Washington, 2020Modern machine learning algorithms have been able to a...
Large language models (LLMs) have garnered significant attention for their remarkable performance in...
Large Transformer models achieved the state-of-the-art status for Natural Language Understanding tas...
This paper systematically investigates the generation of code explanations by Large Language Models ...
Neural language models are increasingly deployed into APIs and websites that allow a user to pass in...
Developing models that can automatically generate detailed code explanation can greatly benefit soft...
One of the most common solutions adopted by software researchers to address code generation is by tr...
Neural language models often fail to generate diverse and informative texts, limiting their applicab...
Current language generation models suffer from issues such as repetition, incoherence, and hallucina...
Predictive modeling using machine learning is an effective method for building compiler heuristics, ...