An ensemble approach for imbalanced multiclass malware classification using 1D-CNN

被引：0

作者：

Panda B. ^{[1
]}

Bisoyi S.S. ^{[2
]}

Panigrahy S. ^{[3
]}

机构：

[1] Department of Computer Science and Engineering, Institute of Technical Education and Research, Siksha ’O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar

[2] Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar

[3] Haas School of Business, University of California, Berkeley, Berkeley, CA

来源：

PeerJ Computer Science | 2023年 / 9卷

关键词：

1D-CNN; API sequence; Dynamic analysis; Ensemble learning; Malware classification; Skip-gram;

D O I：

10.7717/PEERJ-CS.1677

中图分类号：

学科分类号：

摘要：

Dependence on the internet and computer programs demonstrates the significance of computer programs in our day-to-day lives. Such demands motivate malware developers to create more malware, both in terms of quantity and variety. Researchers are constantly faced with hurdles while attempting to protect themselves from potential hazards and risks due to malware authors’ usage of code obfuscation techniques. Metamorphic and polymorphic variations are easily able to elude the widely utilized signature-based detection procedures. Researchers are more interested in deep learning approaches than machine learning techniques to analyze the behavior of such a vast number of virus variants. Researchers have been drawn to the categorization of malware within itself in addition to the classification of malware against benign programs to examine the behavioral differences between them. In order to investigate the relationship between the application programming interface (API) calls throughout API sequences and classify them, this work uses the one-dimensional convolutional neural network (1D-CNN) model to solve a multiclass classification problem. On API sequences, feature vectors for distinctive APIs are created using the Word2Vec word embedding approach and the skip-gram model. The one-vs.-rest approach is used to train 1D-CNN models to categorize malware, and all of them are then combined with a suggested ModifiedSoftVoting algorithm to improve classification. On the open benchmark dataset Mal-API-2019, the suggested ensembled 1D-CNN architecture captures improved evaluation scores with an accuracy of 0.90, a weighted average F1-score of 0.90, and an AUC score of more than 0.96 for all classes of malware. Subjects Data Mining and Machine Learning, Security and Privacy, Neural Networks © 2023 Panda et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.

引用

共 50 条

[21] Snoring Sound Classification Using 1D-CNN Model Based on Multi-Feature Extraction
Adesuyi, TosinAkinwale
Kim, Byeong-Man
Kim, Jongwan
INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (01) : 1 - 10
[22] Advanced deep learning framework for ECG arrhythmia classification using 1D-CNN with attention mechanism
Guhdar, Mohammed
Mohammed, Abdulhakeem O.
Mstafa, Ramadhan J.
KNOWLEDGE-BASED SYSTEMS, 2025, 315
[23] Semi-Supervised Heterogeneous Information Network Embedding for Node Classification using 1D-CNN
Sheikh, Nasrullah
Kefato, Zekarias T.
Montresor, Alberto
2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 177 - 181
[24] Radar-Based Multiple Target Classification in Complex Environments Using 1D-CNN Models
Yanik, Muhammet Emin
Rao, Sandeep
2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
[25] Malware Classification Using Ensemble Classifiers
Hijazi, Mohd Hanafi Ahmad
Beng, Tan Choon
Mountstephens, James
Lim, Yuto
Nisar, Kashif
ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1172 - 1176
[26] Predicting the Wear Amount of Tire Tread Using 1D-CNN
Park, Hyunjae
Seo, Junyeong
Kim, Kangjun
Kim, Taewung
SENSORS, 2024, 24 (21)
[27] An Improved Ensemble Approach for Imbalanced Classification Problems
Krawczyk, Bartosz
Schaefer, Gerald
2013 IEEE 8TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI 2013), 2013, : 423 - 426
[28] A Study on Wheel Member Condition Recognition Using 1D-CNN
Lee, Jin-Han
Lee, Jun-Hee
Lee, Chang-Jae
Lee, Seung-Lok
Kim, Jin-Pyung
Jeong, Jae-Hoon
SENSORS, 2023, 23 (23)
[29] Detecting Attacks on IoT Devices using Featureless 1D-CNN
Khan, Arshiya
Cotton, Chase
PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2021, : 461 - 466
[30] A dual algorithmic approach to deal with multiclass imbalanced classification problems
Sridhar, S.
Anusuya, S.
BIG DATA RESEARCH, 2024, 38

← 1 2 3 4 5 →