Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond

被引：2

作者：

Shi, Ensheng ^{[1
,5
]}

Wang, Yanlin ^{[2
,5
]}

Zhang, Hongyu ^{[3
]}

Du, Lun ^{[4
]}

Han, Shi ^{[4
]}

Zhang, Dongmei ^{[4
]}

Sun, Hongbin ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Xian, Peoples R China

[2] Sun Yat Sen Univ, Zhuhai, Peoples R China

[3] Chongqing Univ, Chongqing, Peoples R China

[4] Microsoft, Beijing, Peoples R China

[5] Microsoft Res Asia, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023 | 2023年

基金：

国家重点研发计划;

关键词：

Empirical study; Pre-Trained Language Models; Efficient Fine-tuning; Probing Techniques; Representational Similarity Analysis;

D O I：

10.1145/3597926.3598036

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recently, fine-tuning pre-trained code models such as CodeBERT on downstream tasks has achieved great success in many software testing and analysis tasks. While effective and prevalent, fine-tuning the pre-trained parameters incurs a large computational cost. In this paper, we conduct an extensive experimental study to explore what happens to layer-wise pre-trained representations and their encoded code knowledge during fine-tuning. We then propose efficient alternatives to fine-tune the large pre-trained code model based on the above findings. Our experimental study shows that (1) lexical, syntactic and structural properties of source code are encoded in the lower, intermediate, and higher layers, respectively, while the semantic property spans across the entire model. (2) The process of fine-tuning preserves most of the code properties. Specifically, the basic code properties captured by lower and intermediate layers are still preserved during fine-tuning. Furthermore, we find that only the representations of the top two layers change most during fine-tuning for various downstream tasks. (3) Based on the above findings, we propose Telly to efficiently fine-tune pre-trained code models via layer freezing. The extensive experimental results on five various downstream tasks demonstrate that training parameters and the corresponding time cost are greatly reduced, while performances are similar or better.

引用

页码：39 / 51

页数：13

共 50 条

[1] An Empirical Study of Parameter-Efficient Fine-Tuning Methods for Pre-trained Code Models
Liu, Jiaxing
Sha, Chaofeng
Peng, Xin
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 397 - 408
[2] Debiasing Pre-Trained Language Models via Efficient Fine-Tuning
Gira, Michael
Zhang, Ruisu
Lee, Kangwook
PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 59 - 69
[3] Span Fine-tuning for Pre-trained Language Models
Bao, Rongzhou
Zhang, Zhuosheng
Zhao, Hai
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1970 - 1979
[4] Parameter-efficient fine-tuning of pre-trained code models for just-in-time defect prediction
Abu Talib M.
Bou Nassif A.
Azzeh M.
Alesh Y.
Afadar Y.
Neural Computing and Applications, 36 (27) : 16911 - 16940
[5] Fine-Tuning Pre-Trained CodeBERT for Code Search in Smart Contract
JIN Huan
LI Qinying
Wuhan University Journal of Natural Sciences, 2023, 28 (03) : 237 - 245
[6] Towards Fine-tuning Pre-trained Language Models with Integer Forward and Backward Propagation
Tayaranian, Mohammadreza
Ghaffari, Alireza
Tahaei, Marzieh S.
Rezagholizadeh, Mehdi
Asgharian, Masoud
Nia, Vahid Partovi
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1912 - 1921
[7] An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models
Liu, Xueqing
Wang, Chi
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2286 - 2300
[8] Exploiting Syntactic Information to Boost the Fine-tuning of Pre-trained Models
Liu, Chaoming
Zhu, Wenhao
Zhang, Xiaoyu
Zhai, Qiuhong
2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 575 - 582
[9] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ning Ding
Yujia Qin
Guang Yang
Fuchao Wei
Zonghan Yang
Yusheng Su
Shengding Hu
Yulin Chen
Chi-Min Chan
Weize Chen
Jing Yi
Weilin Zhao
Xiaozhi Wang
Zhiyuan Liu
Hai-Tao Zheng
Jianfei Chen
Yang Liu
Jie Tang
Juanzi Li
Maosong Sun
Nature Machine Intelligence, 2023, 5 : 220 - 235
[10] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ding, Ning
Qin, Yujia
Yang, Guang
Wei, Fuchao
Yang, Zonghan
Su, Yusheng
Hu, Shengding
Chen, Yulin
Chan, Chi-Min
Chen, Weize
Yi, Jing
Zhao, Weilin
Wang, Xiaozhi
Liu, Zhiyuan
Zheng, Hai-Tao
Chen, Jianfei
Liu, Yang
Tang, Jie
Li, Juanzi
Sun, Maosong
NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +

← 1 2 3 4 5 →