Memory-efficient detection of large-scale obfuscated malware

被引:0
|
作者
Wang Y. [1 ]
Zhang M. [1 ]
机构
[1] College of Computer Science and Technology, Jilin University, Jilin, Changchun
关键词
algorithm; malware; Naïve Bayes;
D O I
10.1504/IJWMC.2024.136586
中图分类号
学科分类号
摘要
Obfuscation techniques are frequently used in malicious programs to evade detection. However, current effective methods often require much memory space during training. This paper proposes a machine-learning-based solution to the malware detection problem that consumes fewer memory resources. We use hash and sparse matrix to build a text bag of words to reduce memory usage during training. Experiments show that our approach reduces the memory footprint by 95% when using 110,000 text data for confusion recognition training compared to the existing model. In the de-obfuscation step, our method improves the recognition accuracy of the import table function by 40%. Our model achieves shallow memory usage during confusion recognition training and enhances the accuracy of imported table recognition. Additionally, the confusion recognition accuracy is only about 10% lower than the confusion recognition model before the improvement. Copyright © 2024 Inderscience Enterprises Ltd.
引用
收藏
页码:48 / 60
页数:12
相关论文
共 50 条
  • [1] Memory-Efficient Learning for Large-Scale Computational Imaging
    Kellman, Michael
    Zhang, Kevin
    Markley, Eric
    Tamir, Jon
    Bostan, Emrah
    Lustig, Michael
    Waller, Laura
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2020, 6 : 1403 - 1414
  • [2] Lazer: Distributed Memory-Efficient Assembly of Large-Scale Genomes
    Goswami, Sayan
    Das, Arghya Kusum
    Platania, Richard
    Lee, Kisung
    Park, Seung-Jong
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1171 - 1181
  • [3] Memory-efficient Large-scale Linear Support Vector Machine
    Alrajeh, Abdullah
    Takeda, Akiko
    Niranjan, Mahesan
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [4] Memory-Efficient Network for Large-scale Video Compressive Sensing
    Cheng, Ziheng
    Chen, Bo
    Liu, Guanliang
    Zhang, Hao
    Lu, Ruiying
    Wang, Zhengjue
    Yuan, Xin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16241 - 16250
  • [5] Memory-Efficient Pipelined Architecture for Large-Scale String Matching
    Yang, Yi-Hua E.
    Prasanna, Viktor K.
    PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 104 - 111
  • [6] Scalable and Memory-Efficient Clustering of Large-Scale Social Networks
    Whang, Joyce Jiyoung
    Sui, Xin
    Dhillon, Inderjit S.
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 705 - 714
  • [7] Memory-Efficient Modeling and Slicing of Large-Scale Adaptive Lattice Structures
    Liu, Shengjun
    Liu, Tao
    Zou, Qiang
    Wang, Weiming
    Doubrovski, Eugeni L.
    Wang, Charlie C. L.
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2021, 21 (06)
  • [8] A Memory-Efficient and Modular Approach for Large-Scale String Pattern Matching
    Le, Hoang
    Prasanna, Viktor K.
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (05) : 844 - 857
  • [9] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA
    Li, Gang
    Li, Fanrong
    Zhao, Tianli
    Cheng, Jian
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 1163 - 1166
  • [10] Memory-efficient Krylov subspace techniques for solving large-scale Lyapunov equations
    Kressner, Daniel
    2008 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-AIDED CONTROL SYSTEM DESIGN, 2008, : 207 - 212