An ensemble approach for imbalanced multiclass malware classification using 1D-CNN

被引:0
|
作者
Panda B. [1 ]
Bisoyi S.S. [2 ]
Panigrahy S. [3 ]
机构
[1] Department of Computer Science and Engineering, Institute of Technical Education and Research, Siksha ’O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[2] Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[3] Haas School of Business, University of California, Berkeley, Berkeley, CA
关键词
1D-CNN; API sequence; Dynamic analysis; Ensemble learning; Malware classification; Skip-gram;
D O I
10.7717/PEERJ-CS.1677
中图分类号
学科分类号
摘要
Dependence on the internet and computer programs demonstrates the significance of computer programs in our day-to-day lives. Such demands motivate malware developers to create more malware, both in terms of quantity and variety. Researchers are constantly faced with hurdles while attempting to protect themselves from potential hazards and risks due to malware authors’ usage of code obfuscation techniques. Metamorphic and polymorphic variations are easily able to elude the widely utilized signature-based detection procedures. Researchers are more interested in deep learning approaches than machine learning techniques to analyze the behavior of such a vast number of virus variants. Researchers have been drawn to the categorization of malware within itself in addition to the classification of malware against benign programs to examine the behavioral differences between them. In order to investigate the relationship between the application programming interface (API) calls throughout API sequences and classify them, this work uses the one-dimensional convolutional neural network (1D-CNN) model to solve a multiclass classification problem. On API sequences, feature vectors for distinctive APIs are created using the Word2Vec word embedding approach and the skip-gram model. The one-vs.-rest approach is used to train 1D-CNN models to categorize malware, and all of them are then combined with a suggested ModifiedSoftVoting algorithm to improve classification. On the open benchmark dataset Mal-API-2019, the suggested ensembled 1D-CNN architecture captures improved evaluation scores with an accuracy of 0.90, a weighted average F1-score of 0.90, and an AUC score of more than 0.96 for all classes of malware. Subjects Data Mining and Machine Learning, Security and Privacy, Neural Networks © 2023 Panda et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.
引用
收藏
相关论文
共 50 条
  • [21] Snoring Sound Classification Using 1D-CNN Model Based on Multi-Feature Extraction
    Adesuyi, TosinAkinwale
    Kim, Byeong-Man
    Kim, Jongwan
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (01) : 1 - 10
  • [22] Advanced deep learning framework for ECG arrhythmia classification using 1D-CNN with attention mechanism
    Guhdar, Mohammed
    Mohammed, Abdulhakeem O.
    Mstafa, Ramadhan J.
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [23] Semi-Supervised Heterogeneous Information Network Embedding for Node Classification using 1D-CNN
    Sheikh, Nasrullah
    Kefato, Zekarias T.
    Montresor, Alberto
    2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 177 - 181
  • [24] Radar-Based Multiple Target Classification in Complex Environments Using 1D-CNN Models
    Yanik, Muhammet Emin
    Rao, Sandeep
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [25] Malware Classification Using Ensemble Classifiers
    Hijazi, Mohd Hanafi Ahmad
    Beng, Tan Choon
    Mountstephens, James
    Lim, Yuto
    Nisar, Kashif
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1172 - 1176
  • [26] Predicting the Wear Amount of Tire Tread Using 1D-CNN
    Park, Hyunjae
    Seo, Junyeong
    Kim, Kangjun
    Kim, Taewung
    SENSORS, 2024, 24 (21)
  • [27] An Improved Ensemble Approach for Imbalanced Classification Problems
    Krawczyk, Bartosz
    Schaefer, Gerald
    2013 IEEE 8TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI 2013), 2013, : 423 - 426
  • [28] A Study on Wheel Member Condition Recognition Using 1D-CNN
    Lee, Jin-Han
    Lee, Jun-Hee
    Lee, Chang-Jae
    Lee, Seung-Lok
    Kim, Jin-Pyung
    Jeong, Jae-Hoon
    SENSORS, 2023, 23 (23)
  • [29] Detecting Attacks on IoT Devices using Featureless 1D-CNN
    Khan, Arshiya
    Cotton, Chase
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2021, : 461 - 466
  • [30] A dual algorithmic approach to deal with multiclass imbalanced classification problems
    Sridhar, S.
    Anusuya, S.
    BIG DATA RESEARCH, 2024, 38