An ensemble approach for imbalanced multiclass malware classification using 1D-CNN

被引:0
|
作者
Panda B. [1 ]
Bisoyi S.S. [2 ]
Panigrahy S. [3 ]
机构
[1] Department of Computer Science and Engineering, Institute of Technical Education and Research, Siksha ’O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[2] Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan (Deemed to be) University, Odisha, Bhubaneswar
[3] Haas School of Business, University of California, Berkeley, Berkeley, CA
关键词
1D-CNN; API sequence; Dynamic analysis; Ensemble learning; Malware classification; Skip-gram;
D O I
10.7717/PEERJ-CS.1677
中图分类号
学科分类号
摘要
Dependence on the internet and computer programs demonstrates the significance of computer programs in our day-to-day lives. Such demands motivate malware developers to create more malware, both in terms of quantity and variety. Researchers are constantly faced with hurdles while attempting to protect themselves from potential hazards and risks due to malware authors’ usage of code obfuscation techniques. Metamorphic and polymorphic variations are easily able to elude the widely utilized signature-based detection procedures. Researchers are more interested in deep learning approaches than machine learning techniques to analyze the behavior of such a vast number of virus variants. Researchers have been drawn to the categorization of malware within itself in addition to the classification of malware against benign programs to examine the behavioral differences between them. In order to investigate the relationship between the application programming interface (API) calls throughout API sequences and classify them, this work uses the one-dimensional convolutional neural network (1D-CNN) model to solve a multiclass classification problem. On API sequences, feature vectors for distinctive APIs are created using the Word2Vec word embedding approach and the skip-gram model. The one-vs.-rest approach is used to train 1D-CNN models to categorize malware, and all of them are then combined with a suggested ModifiedSoftVoting algorithm to improve classification. On the open benchmark dataset Mal-API-2019, the suggested ensembled 1D-CNN architecture captures improved evaluation scores with an accuracy of 0.90, a weighted average F1-score of 0.90, and an AUC score of more than 0.96 for all classes of malware. Subjects Data Mining and Machine Learning, Security and Privacy, Neural Networks © 2023 Panda et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.
引用
收藏
相关论文
共 50 条
  • [41] 1D-CNN: Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features
    Mustaqeem
    Kwon, Soonil
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 4039 - 4059
  • [42] Improving the accuracy of Anomaly Detection in Multimodal Sensors using 1D-CNN
    Imad, Muhammad
    Cleland, Ian
    McAllister, Patrick
    Nugent, Chris
    17TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2024, 2024, : 212 - 221
  • [43] Prediction of surface water pollution using wavelet transform and 1D-CNN
    Wang, Gaofeng
    Zhang, Hao
    Gao, Man
    Zhou, Tao
    Qian, Yun
    WATER SCIENCE AND TECHNOLOGY, 2025, 91 (06) : 684 - 697
  • [44] Prognostics of Aluminum Electrolytic Capacitors Based on Chained-SVR and 1D-CNN Ensemble Learning
    Fanyu Wang
    Yuanfeng Cai
    Hao Tang
    Zequn Lin
    Yiru Pei
    Yichun Wu
    Arabian Journal for Science and Engineering, 2022, 47 : 13995 - 14012
  • [45] Wireless Local Area Networks Threat Detection Using 1D-CNN
    Natkaniec, Marek
    Bednarz, Marcin
    SENSORS, 2023, 23 (12)
  • [46] CA mortar void identification for ballastless track using 1D-CNN
    Chen X.
    Li X.
    Xu L.
    Deng Y.
    Huang H.
    Deng Y.
    Journal of Railway Science and Engineering, 2024, 21 (04) : 1645 - 1655
  • [47] FPGA Implementation of a BPSK 1D-CNN Demodulator
    Liu, Yan
    Shen, Yue
    Li, Li
    Wang, Hai
    APPLIED SCIENCES-BASEL, 2018, 8 (03):
  • [48] Variety classification and identification of jujube based on near-infrared spectroscopy and 1D-CNN
    Li, Xu
    Wu, Jingming
    Bai, Tiecheng
    Wu, Cuiyun
    He, Yufeng
    Huang, Jianxi
    Li, Xuecao
    Shi, Ziyan
    Hou, Kaiyao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 223
  • [49] FPGA-based 1D-CNN accelerator for real-time arrhythmia classification
    Zheming Liu
    Xiaofeng Ling
    Yu Zhu
    Nan Wang
    Journal of Real-Time Image Processing, 2025, 22 (2)
  • [50] Low-Resolution Ground Surveillance Radar Target Classification Based on 1D-CNN
    Xie, Renhong
    Sun, Zeyu
    Wang, Huan
    Li, Peng
    Rui, Yibin
    Wang, Liyan
    Bian, ChenGuang
    ELEVENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 11384