UniXcoder: Unified Cross-Modal Pre-training for Code Representation

被引：0

作者：

Guo, Daya ^{[1
,5
]}

Lu, Shuai ^{[3
]}

Duan, Nan ^{[3
]}

Wang, Yanlin ^{[2
]}

Zhou, Ming ^{[4
]}

Yin, Jian ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Key Lab Big Data Anal & Proc, Guangzhou, Peoples R China

[2] Sun Yat Sen Univ, Sch Software Engn, Guangzhou, Peoples R China

[3] Microsoft Res Asia, Beijing, Peoples R China

[4] Langboat Technol, Beijing, Peoples R China

[5] Microsoft Res, Redmond, WA USA

来源：

PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pre-trained models for programming languages have recently demonstrated great success on code intelligence. To support both code-related understanding and generation tasks, recent works attempt to pre-train unified encoder-decoder models. However, such encoder-decoder framework is sub-optimal for auto-regressive tasks, especially code completion that requires a decoder-only manner for efficient inference. In this paper, we present UniXcoder, a unified cross-modal pre-trained model for programming language. The model utilizes mask attention matrices with prefix adapters to control the behavior of the model and leverages cross-modal contents like AST and code comment to enhance code representation. To encode AST that is represented as a tree in parallel, we propose a one-to-one mapping method to transform AST in a sequence structure that retains all structural information from the tree. Furthermore, we propose to utilize multi-modal contents to learn representation of code fragment with contrastive learning, and then align representations among programming languages using a cross-modal generation task. We evaluate UniXcoder on five code-related tasks over nine datasets. To further evaluate the performance of code fragment representation, we also construct a dataset for a new task, called zero-shot code-to-code search. Results show that our model achieves state-of-the-art performance on most tasks and analysis reveals that comment and AST can both enhance UniXcoder.

引用

页码：7212 / 7225

页数：14

共 50 条

[1] Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Zeng, Yan
Zhou, Wangchunshu
Luo, Ao
Cheng, Ziming
Zhang, Xinsong
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5731 - 5746
[2] COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation
Wen, Keyu
Xia, Jin
Huang, Yuanyuan
Li, Linyang
Xu, Jiayan
Shao, Jie
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2188 - 2197
[3] CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations
Li, Hang
Ding, Wenbiao
Kang, Yu
Liu, Tianqiao
Wu, Zhongqin
Liu, Zitao
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3966 - 3977
[4] Cross-modal Semantic Alignment Pre-training for Vision-and-Language Navigation
Wu, Siying
Fu, Xueyang
Wu, Feng
Zha, Zheng-Jun
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4233 - 4241
[5] Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation
Jiang, Chaoya
Ye, Wei
Xu, Haiyang
Huang, Songfang
Huang, Fei
Zhang, Shikun
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14660 - 14679
[6] Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval
Zhang, Liang
Hu, Anwen
Jin, Qin
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[7] Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Song, Yuqing
Chen, Shizhe
Jin, Qin
Luo, Wei
Xie, Jun
Huang, Fei
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2843 - 2852
[8] VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Wang, Teng
Jiang, Wenhao
Lu, Zhichao
Zheng, Feng
Cheng, Ran
Yin, Chengguo
Luo, Ping
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[9] Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging
Liang, Gongbo
Greenwell, Connor
Zhang, Yu
Xing, Xin
Wang, Xiaoqin
Kavuluru, Ramakanth
Jacobs, Nathan
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1640 - 1649
[10] Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training
Li, Gen
Duan, Nan
Fang, Yuejian
Gong, Ming
Jiang, Daxin
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11336 - 11344

← 1 2 3 4 5 →