Detecting new obfuscated malware variants: A lightweight and interpretable machine learning approach

被引:0
|
作者
Madamidola, Oladipo A. [1 ]
Ngobigha, Felix [1 ]
Ez-zizi, Adnane [1 ]
机构
[1] Univ Suffolk, Waterfront Bldg, Ipswich IP4 1QJ, England
来源
关键词
Cyber security; Obfuscated malware; Detection of unknown malware; Machine learning; Explainable machine learning;
D O I
10.1016/j.iswa.2024.200472
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning has been successfully applied in developing malware detection systems, with a primary focus on accuracy, and increasing attention to reducing computational overhead and improving model interpretability. However, an important question remains underexplored: How well can machine learning-based models detect entirely new forms of malware not present in the training data? In this study, we present a machine learningbased system for detecting obfuscated malware that is not only highly accurate, lightweight and interpretable, but also capable of successfully adapting to new types of malware attacks. Our system is capable of detecting 15 malware subtypes despite being exclusively trained on one malware subtype, namely the Transponder from the Spyware family. This system was built after training 15 distinct random forest-based models, each on a different malware subtype from the CIC-MalMem-2022 dataset. These models were evaluated against the entire range of malware subtypes, including all unseen malware subtypes. To maintain the system's streamlined nature, training was confined to the top five most important features, which also enhanced interpretability. The Transponderfocused model exhibited high accuracy, exceeding 99.8%, with an average processing speed of 5.7 mu s per file. We also illustrate how the Shapley additive explanations technique can facilitate the interpretation of the model predictions. Our research contributes to advancing malware detection methodologies, pioneering the feasibility of detecting obfuscated malware by exclusively training a model on a single or a few carefully selected malware subtype and applying it to detect unseen subtypes.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] An Interpretable Deep Learning Approach for Detecting Marine Heatwaves Patterns
    He, Qi
    Zhu, Zihang
    Zhao, Danfeng
    Song, Wei
    Huang, Dongmei
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [32] Cardiovascular Risk Assessment: An Interpretable Machine Learning Approach
    Paredes, S.
    Rocha, T.
    de Carvalho, P.
    Roseiro, I.
    Henriques, J.
    Sousa, J.
    INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS 2022, ICBHI 2022, 2024, 108 : 95 - 103
  • [33] An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
    Obaido, George
    Ogbuokiri, Blessing
    Swart, Theo G.
    Ayawei, Nimibofa
    Kasongo, Sydney Mambwe
    Aruleba, Kehinde
    Mienye, Ibomoiye Domor
    Aruleba, Idowu
    Chukwu, Williams
    Osaye, Fadekemi
    Egbelowo, Oluwaseun F.
    Simphiwe, Simelane
    Esenogho, Ebenezer
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [34] On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach
    Wei, Dennis
    Nair, Rahul
    Dhurandhar, Amit
    Varshney, Kush R.
    Daly, Elizabeth M.
    Singh, Moninder
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [35] An interpretable and versatile machine learning approach for oocyte phenotyping
    Letort, Gaelle
    Eichmuller, Adrien
    Da Silva, Christelle
    Nikalayevich, Elvira
    Crozet, Flora
    Salle, Jeremy
    Minc, Nicolas
    Labrune, Elsa
    Wolf, Jean-Philippe
    Terret, Marie-Emilie
    Verlhac, Marie-Helene
    JOURNAL OF CELL SCIENCE, 2022, 135 (13)
  • [36] An Interpretable Machine Learning Approach for Laser Lifetime Prediction
    Abdelli, Khouloud
    Griesser, Helmut
    Pachnicke, Stephan
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2024, 42 (06) : 2094 - 2102
  • [37] Machine Learning for SAST: A Lightweight and Adaptable Approach
    Huether, Lorenz
    Sohr, Karsten
    Berger, Bernhard J.
    Rothe, Hendrik
    Edelkamp, Stefan
    COMPUTER SECURITY - ESORICS 2023, PT IV, 2024, 14347 : 85 - 104
  • [38] A Lightweight and Multi-Stage Approach for Android Malware Detection Using Non-Invasive Machine Learning Techniques
    da Costa, Leonardo
    Moia, Vitor
    IEEE ACCESS, 2023, 11 : 73127 - 73144
  • [39] Detecting Social Media Rumor Debunking Effectiveness During Public Health Emergencies: An Interpretable Machine Learning Approach
    Zhang, Shuai
    Hou, Jianhua
    Zhang, Yang
    Yao, Zhizhen
    Zhang, Zhijian
    SCIENCE COMMUNICATION, 2025, 47 (01) : 23 - 56
  • [40] A Novel Malware Analysis Framework for Malware Detection and Classification using Machine Learning Approach
    Sethi, Kamalakanta
    Chaudhary, Shankar Kumar
    Tripathy, Bata Krishan
    Bera, Padmalochan
    ICDCN'18: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2018,