Assemble Foundation Models for Automatic Code Summarization

被引:12
|
作者
Gu, Jian [1 ]
Salza, Pasquale [1 ]
Gall, Harald C. [1 ]
机构
[1] Univ Zurich, Zurich, Switzerland
关键词
transfer learning; adaptive scheme; Transformer; Gaussian noise; code summarization;
D O I
10.1109/SANER53432.2022.00112
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Automatic code summarization is beneficial to software development and maintenance since it reduces the burden of manual tasks. Currently, artificial intelligence is undergoing a paradigm shift. The foundation models pretrained on massive data and finetuned to downstream tasks surpass specially customized models. This trend inspired us to consider reusing foundation models instead of learning from scratch. Based on this, we propose a flexible and robust approach for automatic code summarization based on neural networks. We assemble available foundation models, such as CodeBERT and GPT-2, into a single model named AdaMo. Moreover, we utilize Gaussian noise as the simulation of contextual information to optimize the latent representation. Furthermore, we introduce two adaptive schemes from the perspective of knowledge transfer, namely continuous pretraining and intermediate finetuning, and design intermediate stage tasks for general sequence-to-sequence learning. Finally, we evaluate AdaMo against a benchmark dataset for code summarization, by comparing it with state-of-the-art models.
引用
收藏
页码:935 / 946
页数:12
相关论文
共 50 条
  • [41] A Timeline Summarization of Code Changes
    Decker, Michael J.
    Newman, Christian D.
    Collard, Michael L.
    Guarnera, Drew T.
    Maletic, Jonathan, I
    [J]. 2018 IEEE THIRD INTERNATIONAL WORKSHOP ON DYNAMIC SOFTWARE DOCUMENTATION (DYSDOC3), 2018, : 9 - 10
  • [42] Autofolding for Source Code Summarization
    Fowkes, Jaroslav
    Chanthirasegaran, Pankajan
    Ranca, Razvan
    Allamanis, Miltiadis
    Lapata, Mirella
    Sutton, Charles
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2017, 43 (12) : 1095 - 1109
  • [43] Learning a holistic and comprehensive code representation for code summarization
    Yang, Kaiyuan
    Wang, Junfeng
    Song, Zihua
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 203
  • [44] Automatic sentence segmentation of speech for automatic summarization
    Mrozinski, Joanna
    Whittaker, Edward W. D.
    Chatain, Pierre
    Furui, Sadaoki
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984
  • [45] GA, MR, FFNN, PNN and GMM based models for automatic text summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. COMPUTER SPEECH AND LANGUAGE, 2009, 23 (01): : 126 - 144
  • [46] ASAP: assemble species by automatic partitioning
    Puillandre, Nicolas
    Brouillet, Sophie
    Achaz, Guillaume
    [J]. MOLECULAR ECOLOGY RESOURCES, 2021, 21 (02) : 609 - 620
  • [47] On-the-Fly Adapting Code Summarization on Trainable Cost-Effective Language Models
    Cai, Yufan
    Lin, Yun
    Liu, Chenyan
    Wu, Jinglian
    Zhang, Yifan
    Liu, Yiming
    Gong, Yeyun
    Dong, Jin Song
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Automatic music classification and summarization
    Xu, CS
    Maddage, NC
    Shao, X
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 441 - 450
  • [49] Automatic summarization of music videos
    Shao, Xi
    Xu, Changsheng
    Maddage, Namunu C.
    Tian, Qi
    Kankanhalli, Mohan S.
    Jin, Jesse S.
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2006, 2 (02) : 127 - 148
  • [50] Automatic Summarization of Bug Reports
    Rastkar, Sarah
    Murphy, Gail C.
    Murray, Gabriel
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2014, 40 (04) : 366 - 380