Robust Vulnerability Detection in Solidity-Based Ethereum Smart Contracts Using Fine-Tuned Transformer Encoder Models

被引:0
|
作者
Le, Thi-Thu-Huong [1 ,2 ]
Kim, Jaehyun [2 ]
Lee, Sangmyeong [3 ]
Kim, Howon [3 ]
机构
[1] Pusan Natl Univ, Blockchain Platform Res Ctr, Busan 609735, South Korea
[2] Pusan Natl Univ, IoT Res Ctr, Busan 609735, South Korea
[3] Pusan Natl Univ, Sch Comp Sci & Engn, Busan 609735, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Smart contracts; Codes; Transformers; Security; Solid modeling; Analytical models; Training; Encoding; Biological system modeling; Large language models; Ethereum smart contracts; large language models; multi-class imbalance; multi-class classification; smart contract vulnerability; solidity code;
D O I
10.1109/ACCESS.2024.3482389
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid expansion of blockchain technology, particularly Ethereum, has driven widespread adoption of smart contracts. However, the security of these contracts remains a critical concern due to the increasing frequency and complexity of vulnerabilities. This paper presents a comprehensive approach to detecting vulnerabilities in Ethereum smart contracts using pre-trained Large Language Models (LLMs). We apply transformer-based LLMs, leveraging their ability to understand and analyze Solidity code to identify potential security flaws. Our methodology involves fine-tuning eight distinct pre-trained LLM models on curated datasets varying in types and distributions of vulnerabilities, including multi-class vulnerabilities. The datasets-SB Curate, Benmark Solidity Smart Contract, and ScrawID-were selected to ensure a thorough evaluation of model performance across different vulnerability types. We employed over-sampling techniques to address class imbalances, resulting in more reliable training outcomes. We extensively evaluate these models using precision, recall, accuracy, F1 score, and Receiver Operating Characteristics (ROC) curve metrics. Our results demonstrate that the transformer encoder architecture, with its multi-head attention and feed-forward mechanisms, effectively captures the nuances of smart contract vulnerabilities. The models show promising potential in enhancing the security and reliability of Ethereum smart contracts, offering a robust solution to challenges posed by software vulnerabilities in the blockchain ecosystem.
引用
收藏
页码:154700 / 154717
页数:18
相关论文
共 39 条
  • [31] Offensive Language Detection in Arabic Social Networks Using Evolutionary-Based Classifiers Learned From Fine-Tuned Embeddings
    Shannaq, Fatima
    Hammo, Bassam
    Faris, Hossam
    Castillo-Valdivieso, Pedro A.
    IEEE ACCESS, 2022, 10 : 75018 - 75039
  • [32] ENHANCED FAULT DETECTION AND CLASSIFICATION IN TRANSMISSION LINES USING FINE-TUNED LSTM MODEL AND DBN TRANSFORM-BASED FEATURE SELECTION
    Al Sultan, Muhamed
    Avci, Isa
    Talab, Odia
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, : 76 - 94
  • [33] Automated Sewer Defects Detection Using Style-Based Generative Adversarial Networks and Fine-Tuned Well-Known CNN Classifier
    Situ, Zuxiang
    Teng, Shuai
    Liu, Hanlin
    Luo, Jinhua
    Zhou, Qianqian
    IEEE ACCESS, 2021, 9 : 59498 - 59507
  • [34] SDIF-CNN: Stacking deep image features using fine-tuned convolution neural network models for real-world malware detection and classification
    Kumar, Sanjeev
    Panda, Kajal
    APPLIED SOFT COMPUTING, 2023, 146
  • [35] MSRNet: Multiclass Skin Lesion Recognition Using Additional Residual Block Based Fine-Tuned Deep Models Information Fusion and Best Feature Selection
    Bibi, Sobia
    Khan, Muhammad Attique
    Shah, Jamal Hussain
    Damasevicius, Robertas
    Alasiry, Areej
    Marzougui, Mehrez
    Alhaisoni, Majed
    Masood, Anum
    DIAGNOSTICS, 2023, 13 (19)
  • [36] An efficient, lightweight MobileNetV2-based fine-tuned model for COVID-19 detection using chest X-ray images
    Velu, Shubashini
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 8400 - 8427
  • [37] Explainable Artificial Intelligence-Based IoT Device Malware Detection Mechanism Using Image Visualization and Fine-Tuned CNN-Based Transfer Learning Model
    Naeem, Hamad
    Alshammari, Bandar M.
    Ullah, Farhan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [38] Vision Transformer-Based Anomaly Detection in Smart Grid Phasor Measurement Units Using Deep Learning Models
    Liu, Zhibin
    Wang, Yibo
    Wang, Qingwei
    Hu, Man
    IEEE ACCESS, 2025, 13 : 44565 - 44576
  • [39] Nontargeted screening method for detection of illicit adulterants in dietary supplements and herbal medicines using UHPLC-QTOF-MS with fine-tuned Spec2Vec-based spectral similarity and chemical classification filter
    Sheng, Yanghao
    Xue, Ying
    Wang, Jue
    Liu, Shao
    Jiang, Yueping
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2024, 239