UniXcoder: Unified Cross-Modal Pre-training for Code Representation

被引：0

作者：

Guo, Daya ^{[1
,5
]}

Lu, Shuai ^{[3
]}

Duan, Nan ^{[3
]}

Wang, Yanlin ^{[2
]}

Zhou, Ming ^{[4
]}

Yin, Jian ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Key Lab Big Data Anal & Proc, Guangzhou, Peoples R China

[2] Sun Yat Sen Univ, Sch Software Engn, Guangzhou, Peoples R China

[3] Microsoft Res Asia, Beijing, Peoples R China

[4] Langboat Technol, Beijing, Peoples R China

[5] Microsoft Res, Redmond, WA USA

来源：

PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pre-trained models for programming languages have recently demonstrated great success on code intelligence. To support both code-related understanding and generation tasks, recent works attempt to pre-train unified encoder-decoder models. However, such encoder-decoder framework is sub-optimal for auto-regressive tasks, especially code completion that requires a decoder-only manner for efficient inference. In this paper, we present UniXcoder, a unified cross-modal pre-trained model for programming language. The model utilizes mask attention matrices with prefix adapters to control the behavior of the model and leverages cross-modal contents like AST and code comment to enhance code representation. To encode AST that is represented as a tree in parallel, we propose a one-to-one mapping method to transform AST in a sequence structure that retains all structural information from the tree. Furthermore, we propose to utilize multi-modal contents to learn representation of code fragment with contrastive learning, and then align representations among programming languages using a cross-modal generation task. We evaluate UniXcoder on five code-related tasks over nine datasets. To further evaluate the performance of code fragment representation, we also construct a dataset for a new task, called zero-shot code-to-code search. Results show that our model achieves state-of-the-art performance on most tasks and analysis reveals that comment and AST can both enhance UniXcoder.

引用

页码：7212 / 7225

页数：14

共 50 条

[21] PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting
Guo, Zixin
Wang, Tzu-Jui Julius
Pehlivan, Selen
Radman, Abduljalil
Laaksonen, Jorma
[J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2261 - 2265
[22] COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval
Lu, Haoyu
Fei, Nanyi
Huo, Yuqi
Gao, Yizhao
Lu, Zhiwu
Wen, Ji-Rong
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15671 - 15680
[23] Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering
Gong, Haifan
Chen, Guanqi
Liu, Sishuo
Yu, Yizhou
Li, Guanbin
[J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 456 - 460
[24] Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-modal Knowledge Transfer
Jin, Woojeong
Lee, Dong-Ho
Zhu, Chenguang
Pujara, Jay
Ren, Xiang
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2750 - 2762
[25] Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-training
Li, Zejun
Fan, Zhihao
Chen, JingJing
Zhang, Qi
Huang, Xuanjing
Wei, Zhongyu
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5939 - 5958
[26] ST-BERT: CROSS-MODAL LANGUAGE MODEL PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING
Kim, Minjeong
Kim, Gyuwan
Lee, Sang-Woo
Ha, Jung-Woo
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7478 - 7482
[27] Cross-Modal Pedestrian Re-identification Pre-training Method Based on Catastrophic Forgetting and Combination Superimposed Erasure
Sun, Rui
Xie, Rui-Rui
Zhang, Lei
Zhang, Xu-Dong
Gao, Jun
[J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (10): : 2925 - 2935
[28] A Code for Cross-Modal Working Memory
Constantinidis, Christos
[J]. NEURON, 2016, 89 (01) : 3 - 5
[29] PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts
Li, Yunshui
Hui, Binyuan
Yin, Zhichao
Yang, Min
Huang, Fei
Li, Yongbin
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13402 - 13416
[30] Pre-training Code Representation with Semantic Flow Graph for Effective Bug Localization
Du, Yali
Yu, Zhongxing
[J]. PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 579 - 591

← 1 2 3 4 5 →