With the prevalence of publicly available source code repositories to train deep neural network models, neural program models can do well in source code analysis tasks such as predicting method names in given programs that cannot be easily done by traditional program analysis techniques. Although such neural program models have been tested on various existing datasets, the extent to which they generalize to unforeseen source code is largely unknown. Since it is very challenging to test neural program models on all unforeseen programs, in this paper, we propose to evaluate the generalizability of neural program models with respect to semantic-preserving transformations: a generalizable neural program model should perform equally well on prog...
Over-paramaterized neural models have become dominant in Natural Language Processing. Increasing the...
Neural code intelligence (CI) models are opaque black-boxes and offer little insight on the features...
In recent years, millions of source codes are generated in different languages on a daily basis all ...
Context: With the prevalence of publicly available source code repositories to train deep neural net...
The abundance of publicly available source code repositories, in conjunction with the advances in ne...
Deep Neural Networks (DNNs) are increasingly being used in software engineering and code intelligenc...
Pre-trained code generation models (PCGMs) have been widely applied in neural code generation which ...
When writing programs, people have the ability to tackle a new complex task by decomposing it into s...
Program synthesis, or automatically writing programs from high-level specifications has been a long-...
Reimplementing solutions to previously solved problems is not only inefficient but also introduces i...
The ubiquitousness of software in modern society and the boom in open-source software have made soft...
Programming is a task that has accompanied all computer scientists since as early as the vacuum tube...
Coding conventions are ubiquitous in software engineering practice. Maintaining a uniform coding st...
While neural network-based models have achieved impressive performance on a large body of NLP tasks,...
The application of machine learning (ML) and natural language processing (NLP) methods for creating...
Over-paramaterized neural models have become dominant in Natural Language Processing. Increasing the...
Neural code intelligence (CI) models are opaque black-boxes and offer little insight on the features...
In recent years, millions of source codes are generated in different languages on a daily basis all ...
Context: With the prevalence of publicly available source code repositories to train deep neural net...
The abundance of publicly available source code repositories, in conjunction with the advances in ne...
Deep Neural Networks (DNNs) are increasingly being used in software engineering and code intelligenc...
Pre-trained code generation models (PCGMs) have been widely applied in neural code generation which ...
When writing programs, people have the ability to tackle a new complex task by decomposing it into s...
Program synthesis, or automatically writing programs from high-level specifications has been a long-...
Reimplementing solutions to previously solved problems is not only inefficient but also introduces i...
The ubiquitousness of software in modern society and the boom in open-source software have made soft...
Programming is a task that has accompanied all computer scientists since as early as the vacuum tube...
Coding conventions are ubiquitous in software engineering practice. Maintaining a uniform coding st...
While neural network-based models have achieved impressive performance on a large body of NLP tasks,...
The application of machine learning (ML) and natural language processing (NLP) methods for creating...
Over-paramaterized neural models have become dominant in Natural Language Processing. Increasing the...
Neural code intelligence (CI) models are opaque black-boxes and offer little insight on the features...
In recent years, millions of source codes are generated in different languages on a daily basis all ...