Robust Vulnerability Detection in Solidity-Based Ethereum Smart Contracts Using Fine-Tuned Transformer Encoder Models

被引：0

作者：

Le, Thi-Thu-Huong ^{[1
,2
]}

Kim, Jaehyun ^{[2
]}

Lee, Sangmyeong ^{[3
]}

Kim, Howon ^{[3
]}

机构：

[1] Pusan Natl Univ, Blockchain Platform Res Ctr, Busan 609735, South Korea

[2] Pusan Natl Univ, IoT Res Ctr, Busan 609735, South Korea

[3] Pusan Natl Univ, Sch Comp Sci & Engn, Busan 609735, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Smart contracts; Codes; Transformers; Security; Solid modeling; Analytical models; Training; Encoding; Biological system modeling; Large language models; Ethereum smart contracts; large language models; multi-class imbalance; multi-class classification; smart contract vulnerability; solidity code;

D O I：

10.1109/ACCESS.2024.3482389

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The rapid expansion of blockchain technology, particularly Ethereum, has driven widespread adoption of smart contracts. However, the security of these contracts remains a critical concern due to the increasing frequency and complexity of vulnerabilities. This paper presents a comprehensive approach to detecting vulnerabilities in Ethereum smart contracts using pre-trained Large Language Models (LLMs). We apply transformer-based LLMs, leveraging their ability to understand and analyze Solidity code to identify potential security flaws. Our methodology involves fine-tuning eight distinct pre-trained LLM models on curated datasets varying in types and distributions of vulnerabilities, including multi-class vulnerabilities. The datasets-SB Curate, Benmark Solidity Smart Contract, and ScrawID-were selected to ensure a thorough evaluation of model performance across different vulnerability types. We employed over-sampling techniques to address class imbalances, resulting in more reliable training outcomes. We extensively evaluate these models using precision, recall, accuracy, F1 score, and Receiver Operating Characteristics (ROC) curve metrics. Our results demonstrate that the transformer encoder architecture, with its multi-head attention and feed-forward mechanisms, effectively captures the nuances of smart contract vulnerabilities. The models show promising potential in enhancing the security and reliability of Ethereum smart contracts, offering a robust solution to challenges posed by software vulnerabilities in the blockchain ecosystem.

引用

页码：154700 / 154717

页数：18

共 39 条

[21] Sóley: Automated detection of logic vulnerabilities in Ethereum smart contracts using large language models☆
Soud, Majd
Nuutinen, Waltteri
Liebel, Grischa
JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 226
[22] Smart contracts auditing and multi-classification using machine learning algorithms: an efficient vulnerability detection in ethereum blockchain
El Haddouti, Samia
Khaldoune, Mohammed
Ayache, Meryeme
Ech-Cherif El Kettani, Mohamed Dafir
COMPUTING, 2024, 106 (09) : 2971 - 3003
[23] Question-answering framework for building codes using fine-tuned and distilled pre-trained transformer models
Xue, Xiaorui
Zhang, Jiansong
Chen, Yunfeng
AUTOMATION IN CONSTRUCTION, 2024, 168
[24] A deep dive into automated sexism detection using fine-tuned deep learning and large language models
Vetagiri, Advaitha
Pakray, Partha
Das, Amitava
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
[25] Multiclass vulnerability and clone detection in Ethereum smart contracts using Block-wise Abstract Syntax Tree based Federated Graph Neural Networks
Sharma, Shruti
Ratmele, Ankur
Seth, Abhay Deep
COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
[26] Fire and Smoke Detection Using Fine-Tuned YOLOv8 and YOLOv7 Deep Models
Chetoui, Mohamed
Akhloufi, Moulay A.
FIRE-SWITZERLAND, 2024, 7 (04):
[27] Fine-tuned LSTM-Based Model for Efficient Honeypot-Based Network Intrusion Detection System in Smart Grid Networks
Albaseer, Abdullatif
Abdallah, Mohamed
2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
[28] A fine-tuned vision transformer based enhanced multi-class brain tumor classification using MRI scan imagery
Reddy, C. Kishor Kumar
Reddy, Pulakurthi Anaghaa
Janapati, Himaja
Assiri, Basem
Shuaib, Mohammed
Alam, Shadab
Sheneamer, Abdullah
FRONTIERS IN ONCOLOGY, 2024, 14
[29] Visualized Malware Multi-Classification Framework Using Fine-Tuned CNN-Based Transfer Learning Models
El-Shafai, Walid
Almomani, Iman
AlKhayer, Aala
APPLIED SCIENCES-BASEL, 2021, 11 (14):
[30] Aspect-based sentiment analysis for software requirements elicitation using fine-tuned Bidirectional Encoder Representations from Transformers and Explainable Artificial Intelligence
Taj, Soonh
Daudpota, Sher Muhammad
Imran, Ali Shariq
Kastrati, Zenun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 151

← 1 2 3 4 →