Deobfuscation, unpacking, and decoding of obfuscated malicious Java']JavaScript for machine learning models detection performance improvement

被引:21
|
作者
Ndichu, Samuel [1 ]
Kim, Sangwook [1 ]
Ozawa, Seiichi [1 ,2 ]
机构
[1] Kobe Univ, Grad Sch Engn, Kobe, Hyogo, Japan
[2] Kobe Univ, Ctr Math & Data Sci, Kobe, Hyogo, Japan
关键词
D O I
10.1049/trit.2020.0026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obfuscation is rampant in both benign and malicious JavaScript (JS) codes. It generates an obscure and undetectable code that hinders comprehension and analysis. Therefore, accurate detection of JS codes that masquerade as innocuous scripts is vital. The existing deobfuscation methods assume that a specific tool can recover an original JS code entirely. For a multi-layer obfuscation, general tools realize a formatted JS code, but some sections remain encoded. For the detection of such codes, this study performs Deobfuscation, Unpacking, and Decoding (DUD-preprocessing) by function redefinition using a Virtual Machine (VM), a JS code editor, and a python int_to_str() function to facilitate feature learning by the FastText model. The learned feature vectors are passed to a classifier model that judges the maliciousness of a JS code. In performance evaluation, the authors use the Hynek Petrak's dataset for obfuscated malicious JS codes and the SRILAB dataset and the Majestic Million service top 10,000 websites for obfuscated benign JS codes. They then compare the performance to other models on the detection of DUD-preprocessed obfuscated malicious JS codes. Their experimental results show that the proposed approach enhances feature learning and provides improved accuracy in the detection of obfuscated malicious JS codes.
引用
收藏
页码:184 / 192
页数:9
相关论文
共 40 条
  • [1] Obfuscated Malicious Java']JavaScript Detection by Machine Learning
    Pan, Jinkun
    Mao, Xiaoguang
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 805 - 810
  • [2] Detection of Obfuscated Malicious Java']JavaScript Code
    Alazab, Ammar
    Khraisat, Ansam
    Alazab, Moutaz
    Singh, Sarabjot
    [J]. FUTURE INTERNET, 2022, 14 (08):
  • [3] TransAST: A Machine Translation-Based Approach for Obfuscated Malicious Java']JavaScript Detection
    Qin, Yan
    Wang, Weiping
    Chen, Zixian
    Song, Hong
    Zhang, Shigeng
    [J]. 2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, DSN, 2023, : 327 - 338
  • [4] Obfuscated Malicious Java']Javascript Detection using Classification Techniques
    Likarish, Peter
    Jung, Eunjin E. J.
    Jo, Insoon
    [J]. 2009 4TH INTERNATIONAL CONFERENCE ON MALICIOUS AND UNWANTED SOFTWARE (MALWARE 2009), 2009, : 47 - +
  • [5] JAST: Fully Syntactic Detection of Malicious (Obfuscated) Java']JavaScript
    Fass, Aurore
    Krawczyk, Robert P.
    Backes, Michael
    Stock, Ben
    [J]. DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT, DIMVA 2018, 2018, 10885 : 303 - 325
  • [6] A Half-Dynamic Classification Method on Obfuscated Malicious Java']JavaScript Detection
    Fang, Zhaolin
    Zhu, Renhuan
    Zhang, Weihui
    Chen, Bo
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (06): : 251 - 262
  • [7] Obfuscated Malicious Java']JavaScript Detection Scheme Using the Feature Based on Divided URL
    Morishige, Shoya
    Haruta, Shuichiro
    Asahina, Hiromu
    Sasase, Iwao
    [J]. 2017 23RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): BRIDGING THE METROPOLITAN AND THE REMOTE, 2017, : 518 - 523
  • [8] A Machine Learning Approach to Malicious Java']JavaScript Detection using Fixed Length Vector Representation
    Ndichu, Samuel
    Ozawa, Seiichi
    Misu, Takeshi
    Okada, Kouichirou
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [9] An Empirical Study on the Effects of Obfuscation on Static Machine Learning-Based Malicious Java']JavaScript Detectors
    Ren, Kunlun
    Qiang, Weizhong
    Wu, Yueming
    Zhou, Yi
    Zou, Deqing
    Jin, Hai
    [J]. PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 1420 - 1432
  • [10] Accuracy Improvement Method for Malicious Domain Detection using Machine Learning
    Koga, Toshiki
    Nobayashi, Daiki
    Ikenaga, Takeshi
    [J]. 2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1108 - 1109