Robust Vulnerability Detection in Solidity-Based Ethereum Smart Contracts Using Fine-Tuned Transformer Encoder Models

被引:0
|
作者
Le, Thi-Thu-Huong [1 ,2 ]
Kim, Jaehyun [2 ]
Lee, Sangmyeong [3 ]
Kim, Howon [3 ]
机构
[1] Pusan Natl Univ, Blockchain Platform Res Ctr, Busan 609735, South Korea
[2] Pusan Natl Univ, IoT Res Ctr, Busan 609735, South Korea
[3] Pusan Natl Univ, Sch Comp Sci & Engn, Busan 609735, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Smart contracts; Codes; Transformers; Security; Solid modeling; Analytical models; Training; Encoding; Biological system modeling; Large language models; Ethereum smart contracts; large language models; multi-class imbalance; multi-class classification; smart contract vulnerability; solidity code;
D O I
10.1109/ACCESS.2024.3482389
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid expansion of blockchain technology, particularly Ethereum, has driven widespread adoption of smart contracts. However, the security of these contracts remains a critical concern due to the increasing frequency and complexity of vulnerabilities. This paper presents a comprehensive approach to detecting vulnerabilities in Ethereum smart contracts using pre-trained Large Language Models (LLMs). We apply transformer-based LLMs, leveraging their ability to understand and analyze Solidity code to identify potential security flaws. Our methodology involves fine-tuning eight distinct pre-trained LLM models on curated datasets varying in types and distributions of vulnerabilities, including multi-class vulnerabilities. The datasets-SB Curate, Benmark Solidity Smart Contract, and ScrawID-were selected to ensure a thorough evaluation of model performance across different vulnerability types. We employed over-sampling techniques to address class imbalances, resulting in more reliable training outcomes. We extensively evaluate these models using precision, recall, accuracy, F1 score, and Receiver Operating Characteristics (ROC) curve metrics. Our results demonstrate that the transformer encoder architecture, with its multi-head attention and feed-forward mechanisms, effectively captures the nuances of smart contract vulnerabilities. The models show promising potential in enhancing the security and reliability of Ethereum smart contracts, offering a robust solution to challenges posed by software vulnerabilities in the blockchain ecosystem.
引用
收藏
页码:154700 / 154717
页数:18
相关论文
共 39 条
  • [21] Sóley: Automated detection of logic vulnerabilities in Ethereum smart contracts using large language models☆
    Soud, Majd
    Nuutinen, Waltteri
    Liebel, Grischa
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 226
  • [22] Smart contracts auditing and multi-classification using machine learning algorithms: an efficient vulnerability detection in ethereum blockchain
    El Haddouti, Samia
    Khaldoune, Mohammed
    Ayache, Meryeme
    Ech-Cherif El Kettani, Mohamed Dafir
    COMPUTING, 2024, 106 (09) : 2971 - 3003
  • [23] Question-answering framework for building codes using fine-tuned and distilled pre-trained transformer models
    Xue, Xiaorui
    Zhang, Jiansong
    Chen, Yunfeng
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [24] A deep dive into automated sexism detection using fine-tuned deep learning and large language models
    Vetagiri, Advaitha
    Pakray, Partha
    Das, Amitava
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
  • [25] Multiclass vulnerability and clone detection in Ethereum smart contracts using Block-wise Abstract Syntax Tree based Federated Graph Neural Networks
    Sharma, Shruti
    Ratmele, Ankur
    Seth, Abhay Deep
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [26] Fire and Smoke Detection Using Fine-Tuned YOLOv8 and YOLOv7 Deep Models
    Chetoui, Mohamed
    Akhloufi, Moulay A.
    FIRE-SWITZERLAND, 2024, 7 (04):
  • [27] Fine-tuned LSTM-Based Model for Efficient Honeypot-Based Network Intrusion Detection System in Smart Grid Networks
    Albaseer, Abdullatif
    Abdallah, Mohamed
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [28] A fine-tuned vision transformer based enhanced multi-class brain tumor classification using MRI scan imagery
    Reddy, C. Kishor Kumar
    Reddy, Pulakurthi Anaghaa
    Janapati, Himaja
    Assiri, Basem
    Shuaib, Mohammed
    Alam, Shadab
    Sheneamer, Abdullah
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [29] Visualized Malware Multi-Classification Framework Using Fine-Tuned CNN-Based Transfer Learning Models
    El-Shafai, Walid
    Almomani, Iman
    AlKhayer, Aala
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [30] Aspect-based sentiment analysis for software requirements elicitation using fine-tuned Bidirectional Encoder Representations from Transformers and Explainable Artificial Intelligence
    Taj, Soonh
    Daudpota, Sher Muhammad
    Imran, Ali Shariq
    Kastrati, Zenun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 151