A multi-view feature fusion approach for effective malware classification using Deep Learning

被引:14
|
作者
Chaganti, Rajasekhar [1 ]
Ravi, Vinayakumar [2 ]
Pham, Tuan D. [2 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
[2] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar, Saudi Arabia
关键词
Cybersecurity; Cybercrime; Malware analysis; Portable Executable; Multi-view; Feature fusion; Machine learning; Deep Learning; Convolution Neural Network;
D O I
10.1016/j.jisa.2022.103402
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The number of malware infected machines from all over the world has been growing day by day. New malware variants appear in the wild to evade the malware detection and classification systems and may infect with ransomware or crypto miners for adversary financial gain. A recent colonial pipeline ransomware attack is an example of these attacks that impacted daily human activities, and the victim had to pay ransom to restore their operations. Windows-based systems are the most adopted systems across different industries for running applications. They are prone to get targeted by installing the malware. In this paper, we propose a Deep Learning (DL)-based Convolutional Neural Network (CNN) model to perform the malware classification on Portable Executable (PE) binary files using the fusion feature set approach. We present an extensive performance evaluation of various DL model architecture and Machine Learning (ML) classifier i.e. Support Vector Machine (SVM), on multi-aspect feature sets covering the static, dynamic, and image features to select the proposed CNN model. We further leverage the CNN-based architecture for effective classification of the malware using different combinations of feature sets and compare the results with the best-performed individual feature set. Our performance evaluation of the proposed model shows that the model classifies the malware or benign files with an accuracy of 97% when using fusion feature sets. The proposed model is robust and generalizable and showed similar performances on completely unseen two malware datasets. In addition, the embedding features of the CNN model are visualized, and various visualization methods are employed to understand the characteristics of the datasets. Further, large-scale learning and stacked classifiers were employed after the penultimate layer to enhance the CNN classification performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Fusion of Deep Learning Models for Multi-View Image Classification
    Maguire, Brian
    Seminerio, Eleanor
    [J]. SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII, 2023, 12547
  • [2] Multi-view Feature Fusion for Activity Classification
    Hekmat, Mitra
    Mousavi, Zahra
    Aghajan, Hamid
    [J]. ICDSC 2016: 10TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERA, 2016, : 190 - 195
  • [3] Multi-view SAS Image Classification Using Deep Learning
    Williams, David P.
    Dugelay, Samantha
    [J]. OCEANS 2016 MTS/IEEE MONTEREY, 2016,
  • [4] A Multi-View Deep Evidential Learning Approach for Mammogram Density Classification
    Gudhe, Naga Raju
    Mazen, Sudah
    Sund, Reijo
    Kosma, Veli-Matti
    Behravan, Hamid
    Mannermaa, Arto
    [J]. IEEE ACCESS, 2024, 12 : 67889 - 67909
  • [5] Multi-view Fusion with Deep Learning for 3D Shape Classification
    Huang, Xiang
    Wang, Mantao
    Zhang, Dejun
    Zhu, Yu
    Zou, Lu
    Sun, Jun
    Han, Fei
    He, Linchao
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
  • [6] Ensemble multi-view feature set partitioning method for effective multi-view learning
    Singh, Ritika
    Kumar, Vipin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4957 - 5001
  • [7] Generative multi-view and multi-feature learning for classification
    Li, Jinxing
    Zhang, Bob
    Lu, Guangming
    Zhang, David
    [J]. INFORMATION FUSION, 2019, 45 : 215 - 226
  • [8] Learning from Context: A Multi-View Deep Learning Architecture for Malware Detection
    Kyadige, Adarsh
    Rudd, Ethan M.
    Berlin, Konstantin
    [J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 1 - 7
  • [9] Deep Learning for Multi-View Ultrasonic Image Fusion
    Pilikos, Georgios
    Horchens, Lars
    Van Leeuwen, Tristan
    Lucka, Felix
    [J]. INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS 2021), 2021,
  • [10] A Multi-view Feature Decomposition Deep Learning Method for Lung Cancer Histology Classification
    Gao, Heng
    Wang, Minghui
    Li, Haichun
    Liu, Zhaodi
    Liang, Wei
    Li, Ao
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705