ASSBert: Active and semi-supervised bert for smart contract vulnerability detection

被引:11
|
作者
Sun, Xiaobing [1 ,2 ]
Tu, Liangqiong [1 ,2 ]
Zhang, Jiale [1 ,2 ]
Cai, Jie [1 ,2 ]
Li, Bin [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Yangzhou Univ, Sch Informat Engn, Yangzhou 225127, Jiangsu, Peoples R China
[2] Jiangsu Engn Res Ctr Knowledge Management & Intell, Yangzhou 225127, Jiangsu, Peoples R China
[3] Guangzhou Univ, Inst Artificial Intelligence & Blockchain, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Smart contract; Vulnerability detection; Active learning; Semi-supervised learning;
D O I
10.1016/j.jisa.2023.103423
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the popularity of blockchain, the amount of smart contracts has increased very fast, and the safety of smart contracts has come to more extensive notice. Recently, machine learning technology has been widely applied in vulnerability detection for smart contracts. However, it implements effective smart contract vulnerability detection still faces a major challenge, that is, there is a problem of insufficient labeled data in the current field. Active learning can label data more efficiently. Nevertheless, classical active learning only uses limited labeled data for model training, contrary to the deep learning of a large amount of data required for model training. Because of the above, we provide a new framework, called ASSBert, that leverages active and semi-supervised bidirectional encoder representation from transformers network, which is dedicated to completing the task of smart contract vulnerability classification with a little amount of labeled code data and a large number of unlabeled code data. In our framework, active learning is responsible for selecting highly uncertain code data from unlabeled sol files and putting them into the training set after manual labeling. Besides, semi-supervised learning is charged to continuously pick a certain number of high-confidence unlabeled code data from unlabeled sol files, and put them into the training dataset behind pseudo-labeling. Intuitively, by combining active learning and semi-supervised learning, we are able to get more valuable data to increase the performance of our detection model. In our experiments, we collect our benchmark dataset included 6 vulnerabilities in about 20829 smart contracts. The result of the experiment demonstrates that our framework is superior to the baseline methods with a little amount of labeled code data and a large number of unlabeled code data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Semi-Supervised Active Learning for Object Detection
    Chen, Sijin
    Yang, Yingyun
    Hua, Yan
    [J]. ELECTRONICS, 2023, 12 (02)
  • [2] Active Teacher for Semi-Supervised Object Detection
    Mi, Peng
    Lin, Jianghang
    Zhou, Yiyi
    Shen, Yunhang
    Luo, Gen
    Sun, Xiaoshuai
    Cao, Liujuan
    Fu, Rongrong
    Xu, Qiang
    Ji, Rongrong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14462 - 14471
  • [3] Semi-supervised Active Salient Object Detection
    Lv, Yunqiu
    Liu, Bowen
    Zhang, Jing
    Dai, Yuchao
    Li, Aixuan
    Zhang, Tong
    [J]. PATTERN RECOGNITION, 2022, 123
  • [4] A Smart Contract Vulnerability Detection System Based on BERT Model and Fuzz Testing
    Liang, Zhehao
    Cui, Baojiang
    Wang, Dongbin
    Xu, Jie
    Liu, Huipeng
    [J]. INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS 2024, 2024, 214 : 288 - 295
  • [5] Smart Saint: an active semi-supervised learning internet filter
    Rigo, Felipe Vargas
    Maraes, Murillo Nicacio
    Matsubara, Edson Takashi
    [J]. 2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 136 - 142
  • [6] A Semi-supervised Vulnerability Management System
    Ghosh, Soumyadeep
    Bhaduri, Sourojit
    Kumar, Sanjay
    Verma, Janu
    Katyal, Yatin
    Saraswat, Ankur
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2023, 542 : 97 - 113
  • [7] Semi-Supervised Active Learning for Anomaly Detection in Aviation
    Memarzadeh, Milad
    Matthews, Bryan
    Templin, Thomas
    Sharif Rohani, Aida
    Weckler, Daniel
    [J]. JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2023, 20 (04): : 181 - 194
  • [8] Semi-supervised Active Learning for Video Action Detection
    Singh, Ayush
    Rana, Aayush J.
    Kumar, Akash
    Vyas, Shruti
    Rawat, Yogesh Singh
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4891 - 4899
  • [9] Active link selection for efficient semi-supervised community detection
    Yang, Liang
    Jin, Di
    Wang, Xiao
    Cao, Xiaochun
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [10] Semi-supervised Change Point Detection Using Active Learning
    De Brabandere, Arne
    Cao, Zhenxiang
    De Vos, Maarten
    Bertrand, Alexander
    Davis, Jesse
    [J]. DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 74 - 88