peer reviewedNatural language processing techniques, in particular n-gram models, have been applied successfully to facilitate a number of software engineering tasks. However, in our related ICSME ’18 paper, we have shown that the conclusions of a study can drastically change with respect to how the code is tokenized and how the used n-gram model is parameterized. These choices are thus of utmost importance, and one must carefully make them. To show this and allow the community to benefit from our work, we have developed TUNA (TUning Naturalness-based Analysis), a Java software artifact to perform naturalness-based analyses of source code. To the best of our knowledge, TUNA is the first open- source, end-to-end toolchain to carry out source...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
International audienceOne common task of developing or maintaining software is searching the source ...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use...
peer reviewedMuch of recent software-engineering research has investigated the naturalness of code, ...
Research at the intersection of machine learning, programming languages, and software engineering ha...
peer reviewedBackground: Code is repetitive and predictable in a way that is similar to the natural...
Recent research shows that language models, such as n-gram models, are useful at a wide variety of s...
We live in a time where software is used everywhere. It is used even for creating other software by ...
Background: Code is repetitive and predictable in a way that is similar to the natural language. Thi...
Real software, the kind working programmers produce by the kLOC to solve real-world problems, tends ...
Analyzing source code using computational linguistics and exploiting the linguistic properties of so...
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Software projects are continually evolving, as developers incorporate changes to refactor code, supp...
Thesis (Ph.D.)--University of Washington, 2019Models that automatically map natural language (NL) to...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
International audienceOne common task of developing or maintaining software is searching the source ...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use...
peer reviewedMuch of recent software-engineering research has investigated the naturalness of code, ...
Research at the intersection of machine learning, programming languages, and software engineering ha...
peer reviewedBackground: Code is repetitive and predictable in a way that is similar to the natural...
Recent research shows that language models, such as n-gram models, are useful at a wide variety of s...
We live in a time where software is used everywhere. It is used even for creating other software by ...
Background: Code is repetitive and predictable in a way that is similar to the natural language. Thi...
Real software, the kind working programmers produce by the kLOC to solve real-world problems, tends ...
Analyzing source code using computational linguistics and exploiting the linguistic properties of so...
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Software projects are continually evolving, as developers incorporate changes to refactor code, supp...
Thesis (Ph.D.)--University of Washington, 2019Models that automatically map natural language (NL) to...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
International audienceOne common task of developing or maintaining software is searching the source ...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...