SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations

Niu, Changan
Li, Chuanyi
Ng, Vincent
Ge, Jidong
Huang, Liguo
Luo, Bin

Publication date

May 2022

Abstract

Recent years have seen the successful application of large pre-trained models to code representation learning, resulting in substantial improvements on many code-related downstream tasks. But there are issues surrounding their application to SE tasks. First, the majority of the pre-trained models focus on pre-training only the encoder of the Transformer. For generation tasks that are addressed using models with the encoder-decoder architecture, however, there is no reason why the decoder should be left out during pre-training. Second, many existing pre-trained models, including state-of-the-art models such as T5-learning, simply reuse the pre-training tasks designed for natural languages. Moreover, to learn the natural language description ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations

Abstract

Extracted data

SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations

Abstract

Extracted data

Related items

Related items