A multi-view feature fusion approach for effective malware classification using Deep Learning

被引：14

作者：

Chaganti, Rajasekhar ^{[1
]}

Ravi, Vinayakumar ^{[2
]}

Pham, Tuan D. ^{[2
]}

机构：

[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA

[2] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar, Saudi Arabia

来源：

JOURNAL OF INFORMATION SECURITY AND APPLICATIONS | 2023年 / 72卷

关键词：

Cybersecurity; Cybercrime; Malware analysis; Portable Executable; Multi-view; Feature fusion; Machine learning; Deep Learning; Convolution Neural Network;

D O I：

10.1016/j.jisa.2022.103402

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The number of malware infected machines from all over the world has been growing day by day. New malware variants appear in the wild to evade the malware detection and classification systems and may infect with ransomware or crypto miners for adversary financial gain. A recent colonial pipeline ransomware attack is an example of these attacks that impacted daily human activities, and the victim had to pay ransom to restore their operations. Windows-based systems are the most adopted systems across different industries for running applications. They are prone to get targeted by installing the malware. In this paper, we propose a Deep Learning (DL)-based Convolutional Neural Network (CNN) model to perform the malware classification on Portable Executable (PE) binary files using the fusion feature set approach. We present an extensive performance evaluation of various DL model architecture and Machine Learning (ML) classifier i.e. Support Vector Machine (SVM), on multi-aspect feature sets covering the static, dynamic, and image features to select the proposed CNN model. We further leverage the CNN-based architecture for effective classification of the malware using different combinations of feature sets and compare the results with the best-performed individual feature set. Our performance evaluation of the proposed model shows that the model classifies the malware or benign files with an accuracy of 97% when using fusion feature sets. The proposed model is robust and generalizable and showed similar performances on completely unseen two malware datasets. In addition, the embedding features of the CNN model are visualized, and various visualization methods are employed to understand the characteristics of the datasets. Further, large-scale learning and stacked classifiers were employed after the penultimate layer to enhance the CNN classification performance.

引用

页数：15

共 50 条

[1] Fusion of Deep Learning Models for Multi-View Image Classification
Maguire, Brian
Seminerio, Eleanor
[J]. SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII, 2023, 12547
[2] Multi-view Feature Fusion for Activity Classification
Hekmat, Mitra
Mousavi, Zahra
Aghajan, Hamid
[J]. ICDSC 2016: 10TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERA, 2016, : 190 - 195
[3] Multi-view SAS Image Classification Using Deep Learning
Williams, David P.
Dugelay, Samantha
[J]. OCEANS 2016 MTS/IEEE MONTEREY, 2016,
[4] A Multi-View Deep Evidential Learning Approach for Mammogram Density Classification
Gudhe, Naga Raju
Mazen, Sudah
Sund, Reijo
Kosma, Veli-Matti
Behravan, Hamid
Mannermaa, Arto
[J]. IEEE ACCESS, 2024, 12 : 67889 - 67909
[5] Multi-view Fusion with Deep Learning for 3D Shape Classification
Huang, Xiang
Wang, Mantao
Zhang, Dejun
Zhu, Yu
Zou, Lu
Sun, Jun
Han, Fei
He, Linchao
[J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
[6] Ensemble multi-view feature set partitioning method for effective multi-view learning
Singh, Ritika
Kumar, Vipin
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4957 - 5001
[7] Generative multi-view and multi-feature learning for classification
Li, Jinxing
Zhang, Bob
Lu, Guangming
Zhang, David
[J]. INFORMATION FUSION, 2019, 45 : 215 - 226
[8] Learning from Context: A Multi-View Deep Learning Architecture for Malware Detection
Kyadige, Adarsh
Rudd, Ethan M.
Berlin, Konstantin
[J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 1 - 7
[9] Deep Learning for Multi-View Ultrasonic Image Fusion
Pilikos, Georgios
Horchens, Lars
Van Leeuwen, Tristan
Lucka, Felix
[J]. INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS 2021), 2021,
[10] A Multi-view Feature Decomposition Deep Learning Method for Lung Cancer Histology Classification
Gao, Heng
Wang, Minghui
Li, Haichun
Liu, Zhaodi
Liang, Wei
Li, Ao
[J]. FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705

← 1 2 3 4 5 →