Multi-Head Attention Based Malware Detection with Byte-Level Representation

被引：0

作者：

Thai Vu Nguyen ^{[1
]}

Hoang, Duc N. M. ^{[1
]}

Long Bao Le ^{[1
]}

机构：

[1] Univ Quebec, INRS, Montreal, PQ, Canada

来源：

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年

关键词：

Malware detection; byte-level representation; attention mechanism;

D O I：

10.1109/WCNC57260.2024.10571063

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Machine learning (ML)-based malware detection plays a crucial role in cyber-security by enabling the identification of potential malware threats without relying solely on predefined signatures or rules. Conventional ML approaches require a feature engineering step to analyze and convert collected data (e.g., captured network traffic and malware programs) into a format suitable for model training and prediction. However, this particular step typically requires a considerable depth of domain-specific expertise and also adds additional complexity to the learning process. To mitigate this limitation, we propose to perform malware detection directly based on the byte-level representation of malware data. We employ a byte embedding layer to convert byte sequences into higher-dimension representations. Then, we employ the multi-head attention technique to capture their correlation before forwarding the output to a fully connected deep neural network for malware detection. Extensive experiments on multiple datasets with diverse file formats demonstrated the superior performance of our proposed method. Additionally, we performed an ablation study on the role of the byte-embedding layer to show that our approach does not depend on a high embedding dimension for strong predictive performance, which helps reduce training complexity.

引用

页数：6

共 50 条

[1] Learning Latent Byte-Level Feature Representation for Malware Detection
Yousefi-Azar, Mahmood
Hamey, Len
Varadharajan, Vijay
Chen, Shiping
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 568 - 578
[2] Byte-Level Function-Associated Method for Malware Detection
Hao, Jingwei
Luo, Senlin
Pan, Limin
[J]. Computer Systems Science and Engineering, 2023, 46 (01): : 719 - 734
[3] Byte-level malware classification based on markov images and deep learning
Yuan, Baoguo
Wang, Junfeng
Liu, Dong
Guo, Wen
Wu, Peng
Bao, Xuhua
[J]. COMPUTERS & SECURITY, 2020, 92
[4] Webshell detection with byte-level features based on deep learning
Xiao Zhongzheng
Luktarhan, Nurbol
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 1585 - 1596
[5] A Novel Source Code Representation Approach Based on Multi-Head Attention
Xiao, Lei
Zhong, Hao
Liu, Jianjian
Zhang, Kaiyu
Xu, Qizhen
Chang, Le
[J]. ELECTRONICS, 2024, 13 (11)
[6] Combining Multi-Head Attention and Sparse Multi-Head Attention Networks for Session-Based Recommendation
Zhao, Zhiwei
Wang, Xiaoye
Xiao, Yingyuan
[J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[7] Sarcasm Detection Using Multi-Head Attention Based Bidirectional LSTM
Kumar, Avinash
Narapareddy, Vishnu Teja
Aditya Srikanth, Veerubhotla
Malapati, Aruna
Neti, Lalita Bhanu Murthy
[J]. IEEE ACCESS, 2020, 8 : 6388 - 6397
[8] Epilepsy detection based on multi-head self-attention mechanism
Ru, Yandong
An, Gaoyang
Wei, Zheng
Chen, Hongming
[J]. PLOS ONE, 2024, 19 (06):
[9] Duplicate Question Detection based on Neural Networks and Multi-head Attention
Zhang, Heng
Chen, Liangyu
[J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 13 - 18
[10] Android malware detection based on multi-head squeeze-and-excitation residual network
Zhu, Hui-juan
Gu, Wei
Wang, Liang-min
Xu, Zhi-cheng
Sheng, Victor S.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212

← 1 2 3 4 5 →