Self-Supervised Learning of Smart Contract Representations

被引：3

作者：

Yang, Shouliang ^{[1
]}

Gu, Xiaodong ^{[1
]}

Shen, Beijun ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Software, Shanghai, Peoples R China

来源：

30TH IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

Smart Contract; Self-supervised Learning; Code Representation Learning; Data Augmentation;

D O I：

10.1145/3524610.3527894

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Learning smart contract representations can greatly facilitate the development of smart contracts in many tasks such as bug detection and clone detection. Existing approaches for learning program representations are difficult to apply to smart contracts which have insufficient data and significant homogenization. To overcome these challenges, in this paper, we propose SRCL, a novel, self-supervised approach for learning smart contract representations. Unlike existing supervised methods, which are tied on task-specific data labels, SRCL leverages large-scale unlabeled data by self-supervised learning of both local and global information of smart contracts. It automatically extracts structural sequences from abstract syntax trees (ASTs). Then, two discriminators are designed to guide the Transformer encoder to learn local and global semantic features of smart contracts. We evaluate SRCL on a dataset of 75,006 smart contracts collected from Etherscan. Experimental results show that SRCL considerably outperforms the state-of-the-art code representation models on three downstream tasks.

引用

页码：82 / 93

页数：12

共 50 条

[1] Contrast and Order Representations for Video Self-supervised Learning
Hu, Kai
Shao, Jie
Liu, Yuan
Raj, Bhiksha
Savvides, Marios
Shen, Zhiqiang
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7919 - 7929
[2] Learning Action Representations for Self-supervised Visual Exploration
Oh, Changjae
Cavallaro, Andrea
[J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5873 - 5879
[3] Self-supervised graph representations with generative adversarial learning
Sun, Xuecheng
Wang, Zonghui
Lu, Zheming
Lu, Ziqian
[J]. NEUROCOMPUTING, 2024, 592
[4] Self-supervised learning of Dynamic Representations for Static Images
Song, Siyang
Sanchez, Enrique
Shen, Linlin
Valstar, Michel
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1619 - 1626
[5] Deep Bregman divergence for self-supervised representations learning
Rezaei, Mina
Soleymani, Farzin
Bischl, Bernd
Azizi, Shekoofeh
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
[6] Self-Supervised Learning of Pretext-Invariant Representations
Misra, Ishan
van der Maaten, Laurens
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6706 - 6716
[7] Learning Self-Supervised Multimodal Representations of Human Behaviour
Shukla, Abhinav
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4748 - 4751
[8] Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Wang, Zhepei
Subakan, Cem
Jiang, Xilin
Wu, Junkai
Tzinis, Efthymios
Ravanelli, Mirco
Smaragdis, Paris
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2607 - 2611
[9] ADDING DISTANCE INFORMATION TO SELF-SUPERVISED LEARNING FOR RICH REPRESENTATIONS
Kim, Yeji
Kong, Bai-Sun
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1270 - 1274
[10] Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Zhao, Yucheng
Wang, Guangting
Luo, Chong
Zeng, Wenjun
Zha, Zheng-Jun
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10140 - 10149

← 1 2 3 4 5 →