A Novel Image-Based Malware Classification Model Using Deep Learning

被引:5
|
作者
Jiang, Yongkang [1 ]
Li, Shenghong [1 ]
Wu, Yue [1 ]
Zou, Futai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
关键词
Malware; Embedding; Classification; Deep learning;
D O I
10.1007/978-3-030-36711-4_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the vast volume of data which needs to be evaluated potentially malicious is becoming one of the major challenges of antivirus products. In this paper, we propose a novel image-based mal-ware classification model using deep learning to counter large-scale mal-ware analysis. The model includes a malware embedding method called YongImage which maps instruction-level information and disassembly metadata generated by IDA disassembler tool into an image vector, and a deep neural network named malVecNet which has simpler structure and faster convergence rate. Our proposed YongImage converts malware analysis tasks into image classification problems, which do not rely on domain knowledge and complex feature extraction. Meanwhile, we use the thought of sentence-level classification in Natural Language Processing to establish and optimize our malVecNet. Compared to previous work, malVecNet has better theoretical interpretability and can be trained more effectively. We use 10-fold cross-validation on Microsoft malware classification challenge dataset to evaluate our model. The results demonstrate that our model can achieve 99.49% accuracy with 0.022 log loss. Although our scheme is less precise than the winner's, it makes an orders-of-magnitude performance boost. Compared with other related work, our model also outperforms most of them.
引用
收藏
页码:150 / 161
页数:12
相关论文
共 50 条
  • [1] Deriving Optimal Deep Learning Models for Image-based Malware Classification
    Mitsuhashi, Rikima
    Shinagawa, Takahiro
    [J]. 37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1727 - 1729
  • [2] Deep Learning versus Gist Descriptors for Image-based Malware Classification
    Yajamanam, Sravani
    Selvin, Vikash Raja Samuel
    Di Troia, Fabio
    Stamp, Mark
    [J]. ICISSP: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2018, : 553 - 561
  • [3] Transfer Learning for Image-based Malware Classification
    Bhodia, Niket
    Prajapati, Pratikkumar
    Di Troia, Fabio
    Stamp, Mark
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY (ICISSP), 2019, : 719 - 726
  • [4] Exploring Optimal Deep Learning Models for Image-based Malware Variant Classification
    Mitsuhashi, Rikima
    Shinagawa, Takahiro
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 779 - 788
  • [5] Deep learning for image-based mobile malware detection
    Mercaldo, Francesco
    Santone, Antonella
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2020, 16 (02) : 157 - 171
  • [6] Deep learning for image-based mobile malware detection
    Francesco Mercaldo
    Antonella Santone
    [J]. Journal of Computer Virology and Hacking Techniques, 2020, 16 : 157 - 171
  • [7] IMCLNet: A lightweight deep neural network for Image-based Malware Classification
    Zou, Binghui
    Cao, Chunjie
    Tao, Fangjian
    Wang, Longjuan
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 70
  • [8] Image-Based Malware Classification Using Convolutional Neural Network
    Kim, Hae-Jung
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 1352 - 1357
  • [9] Image-based malware classification using section distribution information
    Xiao, Mao
    Guo, Chun
    Shen, Guowei
    Cui, Yunhe
    Jiang, Chaohui
    [J]. COMPUTERS & SECURITY, 2021, 110
  • [10] Broad learning: A GPU-free image-based malware classification
    Vasan, Danish
    Hammoudeh, Mohammad
    Alazab, Mamoun
    [J]. APPLIED SOFT COMPUTING, 2024, 154