Memory-efficient detection of large-scale obfuscated malware

被引:0
|
作者
Wang Y. [1 ]
Zhang M. [1 ]
机构
[1] College of Computer Science and Technology, Jilin University, Jilin, Changchun
关键词
algorithm; malware; Naïve Bayes;
D O I
10.1504/IJWMC.2024.136586
中图分类号
学科分类号
摘要
Obfuscation techniques are frequently used in malicious programs to evade detection. However, current effective methods often require much memory space during training. This paper proposes a machine-learning-based solution to the malware detection problem that consumes fewer memory resources. We use hash and sparse matrix to build a text bag of words to reduce memory usage during training. Experiments show that our approach reduces the memory footprint by 95% when using 110,000 text data for confusion recognition training compared to the existing model. In the de-obfuscation step, our method improves the recognition accuracy of the import table function by 40%. Our model achieves shallow memory usage during confusion recognition training and enhances the accuracy of imported table recognition. Additionally, the confusion recognition accuracy is only about 10% lower than the confusion recognition model before the improvement. Copyright © 2024 Inderscience Enterprises Ltd.
引用
收藏
页码:48 / 60
页数:12
相关论文
共 50 条
  • [31] Enhanced detection of obfuscated malware in memory dumps: a machine learning approach for advanced cybersecurity
    Hossain, Md. Alamgir
    Islam, Md. Saiful
    CYBERSECURITY, 2024, 7 (01)
  • [32] Hierarchical Management of Large-Scale Malware Data
    Kellogg, Lee
    Ruttenberg, Brian
    O'Connor, Alison
    Howard, Michael
    Pfeffer, Avi
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 666 - 674
  • [33] PHash: A memory-efficient, high-performance key-value store for large-scale data-intensive applications
    Shim, Hyotaek
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 123 : 33 - 44
  • [34] Large-scale Malware Automatic Detection Based On Multiclass Features and Machine Learning
    Wang, Zhiqiang
    Tang, Yao
    Yao, Jing
    Qian, Rong
    Zhang, Zheng
    Ma, Pingchuan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [35] An efficient algorithm for large-scale detection of protein families
    Enright, AJ
    Van Dongen, S
    Ouzounis, CA
    NUCLEIC ACIDS RESEARCH, 2002, 30 (07) : 1575 - 1584
  • [36] Towards Memory-Efficient Validation of Large XMI Models
    Jahanbin, Sorour
    Kolovos, Dimitris
    Gerasimou, Simos
    Proceedings - 2023 ACM/IEEE International Conference on Model Driven Engineering Languages and Systems Companion, MODELS-C 2023, 2023, : 241 - 250
  • [37] Memory efficient large-scale image-based localization
    Guoyu Lu
    Nicu Sebe
    Congfu Xu
    Chandra Kambhamettu
    Multimedia Tools and Applications, 2015, 74 : 479 - 503
  • [38] Obfuscated Memory Malware Detection in Resource-Constrained IoT Devices for Smart City Applications
    Shafin, Sakib Shahriar
    Karmakar, Gour
    Mareels, Iven
    SENSORS, 2023, 23 (11)
  • [39] Memory efficient large-scale image-based localization
    Lu, Guoyu
    Sebe, Nicu
    Xu, Congfu
    Kambhamettu, Chandra
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 479 - 503
  • [40] Towards Memory-Efficient Validation of Large XMI Models
    Jahanbin, Sorour
    Kolovos, Dimitris
    Gerasimou, Simos
    2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 241 - 250