Deobfuscation, unpacking, and decoding of obfuscated malicious Java']JavaScript for machine learning models detection performance improvement

被引：21

作者：

Ndichu, Samuel ^{[1
]}

Kim, Sangwook ^{[1
]}

Ozawa, Seiichi ^{[1
,2
]}

机构：

[1] Kobe Univ, Grad Sch Engn, Kobe, Hyogo, Japan

[2] Kobe Univ, Ctr Math & Data Sci, Kobe, Hyogo, Japan

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2020年 / 5卷 / 03期

关键词：

D O I：

10.1049/trit.2020.0026

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Obfuscation is rampant in both benign and malicious JavaScript (JS) codes. It generates an obscure and undetectable code that hinders comprehension and analysis. Therefore, accurate detection of JS codes that masquerade as innocuous scripts is vital. The existing deobfuscation methods assume that a specific tool can recover an original JS code entirely. For a multi-layer obfuscation, general tools realize a formatted JS code, but some sections remain encoded. For the detection of such codes, this study performs Deobfuscation, Unpacking, and Decoding (DUD-preprocessing) by function redefinition using a Virtual Machine (VM), a JS code editor, and a python int_to_str() function to facilitate feature learning by the FastText model. The learned feature vectors are passed to a classifier model that judges the maliciousness of a JS code. In performance evaluation, the authors use the Hynek Petrak's dataset for obfuscated malicious JS codes and the SRILAB dataset and the Majestic Million service top 10,000 websites for obfuscated benign JS codes. They then compare the performance to other models on the detection of DUD-preprocessed obfuscated malicious JS codes. Their experimental results show that the proposed approach enhances feature learning and provides improved accuracy in the detection of obfuscated malicious JS codes.

引用

页码：184 / 192

页数：9

共 40 条

[1] Obfuscated Malicious Java']JavaScript Detection by Machine Learning
Pan, Jinkun
Mao, Xiaoguang
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 805 - 810
[2] Detection of Obfuscated Malicious Java']JavaScript Code
Alazab, Ammar
Khraisat, Ansam
Alazab, Moutaz
Singh, Sarabjot
[J]. FUTURE INTERNET, 2022, 14 (08):
[3] TransAST: A Machine Translation-Based Approach for Obfuscated Malicious Java']JavaScript Detection
Qin, Yan
Wang, Weiping
Chen, Zixian
Song, Hong
Zhang, Shigeng
[J]. 2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, DSN, 2023, : 327 - 338
[4] Obfuscated Malicious Java']Javascript Detection using Classification Techniques
Likarish, Peter
Jung, Eunjin E. J.
Jo, Insoon
[J]. 2009 4TH INTERNATIONAL CONFERENCE ON MALICIOUS AND UNWANTED SOFTWARE (MALWARE 2009), 2009, : 47 - +
[5] JAST: Fully Syntactic Detection of Malicious (Obfuscated) Java']JavaScript
Fass, Aurore
Krawczyk, Robert P.
Backes, Michael
Stock, Ben
[J]. DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT, DIMVA 2018, 2018, 10885 : 303 - 325
[6] A Half-Dynamic Classification Method on Obfuscated Malicious Java']JavaScript Detection
Fang, Zhaolin
Zhu, Renhuan
Zhang, Weihui
Chen, Bo
[J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2015, 9 (06): : 251 - 262
[7] Obfuscated Malicious Java']JavaScript Detection Scheme Using the Feature Based on Divided URL
Morishige, Shoya
Haruta, Shuichiro
Asahina, Hiromu
Sasase, Iwao
[J]. 2017 23RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): BRIDGING THE METROPOLITAN AND THE REMOTE, 2017, : 518 - 523
[8] A Machine Learning Approach to Malicious Java']JavaScript Detection using Fixed Length Vector Representation
Ndichu, Samuel
Ozawa, Seiichi
Misu, Takeshi
Okada, Kouichirou
[J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[9] An Empirical Study on the Effects of Obfuscation on Static Machine Learning-Based Malicious Java']JavaScript Detectors
Ren, Kunlun
Qiang, Weizhong
Wu, Yueming
Zhou, Yi
Zou, Deqing
Jin, Hai
[J]. PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 1420 - 1432
[10] Accuracy Improvement Method for Malicious Domain Detection using Machine Learning
Koga, Toshiki
Nobayashi, Daiki
Ikenaga, Takeshi
[J]. 2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1108 - 1109

← 1 2 3 4 →