NtNDet: Hardware Trojan detection based on pre-trained language models

被引：0

作者：

Kuang, Shijie ^{[1
]}

Quan, Zhe ^{[1
]}

Xie, Guoqi ^{[1
]}

Cai, Xiaomin ^{[2
,3
]}

Li, Keqin ^{[4
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China

[2] Hunan Univ Finance & Econ, Sch Comp Sci & Technol, Changsha, Peoples R China

[3] Acad Mil Sci, Beijing, Peoples R China

[4] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 271卷

关键词：

Gate-level netlists; Hardware Trojan detection; Large language model; Netlist-to-natural-language; Transfer learning;

D O I：

10.1016/j.eswa.2025.126666

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hardware Trojans (HTs) are malicious modifications embedded in Integrated Circuits (ICs) that pose a significant threat to security. The concealment of HTs and the complexity of IC manufacturing make them difficult to detect. An effective solution is identifying HTs at the gate level through machine learning techniques. However, current methods primarily depend on end-to-end training, which fails to fully utilize the advantages of large-scale pre-trained models and transfer learning. Additionally, they do not take advantage of the extensive background knowledge available in massive datasets. This study proposes an HT detection approach based on large-scale pre-trained NLP models. We propose a novel approach named NtNDet, which includes a method called Netlist-to-Natural-Language (NtN) for converting gate-level netlists into a natural language format suitable for Natural Language Processing (NLP) models. We apply the self-attention mechanism of Transformer to model complex dependencies within the netlist. This is the first application of large-scale pre- trained models for gate-level netlists HT detection, promoting the use of pre-trained models in the security field. Experiments on the Trust-Hub, TRIT-TC, and TRIT-TS benchmarks demonstrate that our approach outperforms existing HT detection methods. The precision increased by at least 5.27%, The True Positive Rate (TPR) by 3.06%, the True Negative Rate (TNR) by 0.01%, and the F1 score increased by about 3.17%, setting a new state-of-the-art in HT detection.

引用

页数：13

共 50 条

[21] Jailbreaking Pre-trained Large Language Models Towards Hardware Vulnerability Insertion Ability
Wan, Gwok-Waa
Wong, Sam-Zaak
Wang, Xi
PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 579 - 582
[22] A Study of Pre-trained Language Models in Natural Language Processing
Duan, Jiajia
Zhao, Hui
Zhou, Qian
Qiu, Meikang
Liu, Meiqin
2020 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2020), 2020, : 116 - 121
[23] Entity Resolution Based on Pre-trained Language Models with Two Attentions
Zhu, Liang
Liu, Hao
Song, Xin
Wei, Yonggang
Wang, Yu
WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 433 - 448
[24] Intelligent Completion of Ancient Texts Based on Pre-trained Language Models
Li J.
Ming C.
Guo Z.
Qian T.
Peng Z.
Wang X.
Li X.
Li J.
Data Analysis and Knowledge Discovery, 2024, 8 (05) : 59 - 67
[25] A Brief Review of Relation Extraction Based on Pre-Trained Language Models
Xu, Tiange
Zhang, Fu
FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 775 - 789
[26] Discrimination Bias Detection Through Categorical Association in Pre-Trained Language Models
Dusi, Michele
Arici, Nicola
Gerevini, Alfonso Emilio
Putelli, Luca
Serina, Ivan
IEEE ACCESS, 2024, 12 : 162651 - 162667
[27] Pre-trained Trojan Attacks for Visual Recognition
Liu, Aishan
Liu, Xianglong
Zhang, Xinwei
Xiao, Yisong
Zhou, Yuguang
Liang, Siyuan
Wang, Jiakai
Cao, Xiaochun
Tao, Dacheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
[28] From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Models to Pre-trained Machine Reader
Xu, Weiwen
Li, Xin
Zhang, Wenxuan
Zhou, Meng
Lam, Wai
Si, Luo
Bing, Lidong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[29] Pre-trained models for natural language processing: A survey
Qiu XiPeng
Sun TianXiang
Xu YiGe
Shao YunFan
Dai Ning
Huang XuanJing
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (10) : 1872 - 1897
[30] Probing Pre-Trained Language Models for Disease Knowledge
Alghanmi, Israa
Espinosa-Anke, Luis
Schockaert, Steven
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3023 - 3033

← 1 2 3 4 5 →