Multi-Head Attention Based Malware Detection with Byte-Level Representation

被引:0
|
作者
Thai Vu Nguyen [1 ]
Hoang, Duc N. M. [1 ]
Long Bao Le [1 ]
机构
[1] Univ Quebec, INRS, Montreal, PQ, Canada
关键词
Malware detection; byte-level representation; attention mechanism;
D O I
10.1109/WCNC57260.2024.10571063
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML)-based malware detection plays a crucial role in cyber-security by enabling the identification of potential malware threats without relying solely on predefined signatures or rules. Conventional ML approaches require a feature engineering step to analyze and convert collected data (e.g., captured network traffic and malware programs) into a format suitable for model training and prediction. However, this particular step typically requires a considerable depth of domain-specific expertise and also adds additional complexity to the learning process. To mitigate this limitation, we propose to perform malware detection directly based on the byte-level representation of malware data. We employ a byte embedding layer to convert byte sequences into higher-dimension representations. Then, we employ the multi-head attention technique to capture their correlation before forwarding the output to a fully connected deep neural network for malware detection. Extensive experiments on multiple datasets with diverse file formats demonstrated the superior performance of our proposed method. Additionally, we performed an ablation study on the role of the byte-embedding layer to show that our approach does not depend on a high embedding dimension for strong predictive performance, which helps reduce training complexity.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Learning Latent Byte-Level Feature Representation for Malware Detection
    Yousefi-Azar, Mahmood
    Hamey, Len
    Varadharajan, Vijay
    Chen, Shiping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 568 - 578
  • [2] Byte-Level Function-Associated Method for Malware Detection
    Hao, Jingwei
    Luo, Senlin
    Pan, Limin
    [J]. Computer Systems Science and Engineering, 2023, 46 (01): : 719 - 734
  • [3] Byte-level malware classification based on markov images and deep learning
    Yuan, Baoguo
    Wang, Junfeng
    Liu, Dong
    Guo, Wen
    Wu, Peng
    Bao, Xuhua
    [J]. COMPUTERS & SECURITY, 2020, 92
  • [4] Webshell detection with byte-level features based on deep learning
    Xiao Zhongzheng
    Luktarhan, Nurbol
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 1585 - 1596
  • [5] A Novel Source Code Representation Approach Based on Multi-Head Attention
    Xiao, Lei
    Zhong, Hao
    Liu, Jianjian
    Zhang, Kaiyu
    Xu, Qizhen
    Chang, Le
    [J]. ELECTRONICS, 2024, 13 (11)
  • [6] Combining Multi-Head Attention and Sparse Multi-Head Attention Networks for Session-Based Recommendation
    Zhao, Zhiwei
    Wang, Xiaoye
    Xiao, Yingyuan
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [7] Sarcasm Detection Using Multi-Head Attention Based Bidirectional LSTM
    Kumar, Avinash
    Narapareddy, Vishnu Teja
    Aditya Srikanth, Veerubhotla
    Malapati, Aruna
    Neti, Lalita Bhanu Murthy
    [J]. IEEE ACCESS, 2020, 8 : 6388 - 6397
  • [8] Epilepsy detection based on multi-head self-attention mechanism
    Ru, Yandong
    An, Gaoyang
    Wei, Zheng
    Chen, Hongming
    [J]. PLOS ONE, 2024, 19 (06):
  • [9] Duplicate Question Detection based on Neural Networks and Multi-head Attention
    Zhang, Heng
    Chen, Liangyu
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 13 - 18
  • [10] Android malware detection based on multi-head squeeze-and-excitation residual network
    Zhu, Hui-juan
    Gu, Wei
    Wang, Liang-min
    Xu, Zhi-cheng
    Sheng, Victor S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212