An ensemble approach for imbalanced multiclass malware classification using 1D-CNN

被引:0
|
作者
Panda B. [1 ]
Bisoyi S.S. [2 ]
Panigrahy S. [3 ]
机构
[1] Department of Computer Science and Engineering, Institute of Technical Education and Research, Siksha ’O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[2] Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[3] Haas School of Business, University of California, Berkeley, Berkeley, CA
关键词
1D-CNN; API sequence; Dynamic analysis; Ensemble learning; Malware classification; Skip-gram;
D O I
10.7717/PEERJ-CS.1677
中图分类号
学科分类号
摘要
Dependence on the internet and computer programs demonstrates the significance of computer programs in our day-to-day lives. Such demands motivate malware developers to create more malware, both in terms of quantity and variety. Researchers are constantly faced with hurdles while attempting to protect themselves from potential hazards and risks due to malware authors’ usage of code obfuscation techniques. Metamorphic and polymorphic variations are easily able to elude the widely utilized signature-based detection procedures. Researchers are more interested in deep learning approaches than machine learning techniques to analyze the behavior of such a vast number of virus variants. Researchers have been drawn to the categorization of malware within itself in addition to the classification of malware against benign programs to examine the behavioral differences between them. In order to investigate the relationship between the application programming interface (API) calls throughout API sequences and classify them, this work uses the one-dimensional convolutional neural network (1D-CNN) model to solve a multiclass classification problem. On API sequences, feature vectors for distinctive APIs are created using the Word2Vec word embedding approach and the skip-gram model. The one-vs.-rest approach is used to train 1D-CNN models to categorize malware, and all of them are then combined with a suggested ModifiedSoftVoting algorithm to improve classification. On the open benchmark dataset Mal-API-2019, the suggested ensembled 1D-CNN architecture captures improved evaluation scores with an accuracy of 0.90, a weighted average F1-score of 0.90, and an AUC score of more than 0.96 for all classes of malware. Subjects Data Mining and Machine Learning, Security and Privacy, Neural Networks © 2023 Panda et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.
引用
收藏
相关论文
共 50 条
  • [31] The PCA and 1D-CNN Dimension Reduction Comparison for Hyperspectral Classification of Tree Species
    Badidova, Bianca
    Forgac, Radoslav
    Ockay, Milos
    2024 NEW TRENDS IN SIGNAL PROCESSING, NTSP 2024, 2024, : 6 - 10
  • [32] Detection of Corona Faults in Switchgear by Using 1D-CNN, LSTM, and 1D-CNN-LSTM Methods
    Alsumaidaee, Yaseen Ahmed Mohammed
    Yaw, Chong Tak
    Koh, Siaw Paw
    Tiong, Sieh Kiong
    Chen, Chai Phing
    Yusaf, Talal
    Abdalla, Ahmed N.
    Ali, Kharudin
    Raj, Avinash Ashwin
    SENSORS, 2023, 23 (06)
  • [33] 1D-CNN FOR LAND COVER CLASSIFICATION OF SENTINEL-3 ALTIMETRY WAVEFORMS USING ADDITIONAL FEATURES
    Eitel, Maximilian
    Schmitt, Michael
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 3058 - 3061
  • [34] CADNet: cardiac arrhythmia detection and classification using unified principal component analysis and 1D-CNN model
    Borra S.R.
    Nayana D.R.G.A.
    Srinidhi S.
    Bhavana S.
    Nishitha P.
    Sahithi V.
    Research on Biomedical Engineering, 2024, 40 (2) : 317 - 329
  • [35] Research on A Classification Algorithm of Near-Infrared Spectroscopy Based on 1D-CNN
    Pu Shan-shan
    Zheng En-rang
    Chen Bei
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43 (08) : 2446 - 2451
  • [36] Stepped frequency radar target recognition using 1D-CNN
    Jouny, I
    AUTOMATIC TARGET RECOGNITION XXXII, 2022, 12096
  • [37] An efficient approach for diagnosing faults in photovoltaic array using 1D-CNN and feature selection Techniques
    Ali, Yousif Mahmoud
    Ding, Lei
    Qin, Shiyao
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2025, 166
  • [38] Prognostics of Aluminum Electrolytic Capacitors Based on Chained-SVR and 1D-CNN Ensemble Learning
    Wang, Fanyu
    Cai, Yuanfeng
    Tang, Hao
    Lin, Zequn
    Pei, Yiru
    Wu, Yichun
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (11) : 13995 - 14012
  • [39] Ensemble Machine Learning Approach for Android Malware Classification Using Hybrid Features
    Pektas, Abdurrahman
    Acarman, Tankut
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 191 - 200
  • [40] An Ensemble approach for advance malware memory analysis using Image classification techniques
    Vashishtha, Lalit Kumar
    Chatterjee, Kakali
    Rout, Siddhartha Suman
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2023, 77