WS-LSMR: Malicious WebShell Detection Algorithm Based on Ensemble Learning

被引：10

作者：

Ai, Zhuang ^{[1
]}

Luktarhan, Nurbol ^{[1
]}

Zhao, Yuxin ^{[2
]}

Tang, Chaofei ^{[2
]}

机构：

[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China

[2] Xinjiang Univ, Coll Software, Urumqi 830046, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Machine learning algorithms; Forestry; Trojan horses; Training; Adaptation models; Prediction algorithms; Ensemble learning; information entropy; WebShell;

D O I：

10.1109/ACCESS.2020.2989304

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To solve the problem that the features produced by hidden means, such as code obfuscation and compression, in encrypted malicious WebShell files are not the same as those produced by non-encrypted files, a WebShell attack detection algorithm based on ensemble learning is proposed. First, this algorithm extracted the feature vocabulary of the unigrams and 4-grams based on opcode; subsequently, the 4-gram feature word weights were obtained according to the calculated Gini coefficient of the unigram feature words and used to select the features, which will be selected again based on the Gini coefficient of the 4-gram feature words. Consequently, a feature vocabulary that can detect encrypted and unencrypted WebShell files was constructed. Second, in order to improve the adaptability and accuracy of the detection method, an ensemble detection model called WS-LSMR, consisting of a Logistic Regression, Support Vector Machine, Multi-layer Perceptron and Random Forest, was constructed. The model uses a weighted voting method to determine the WebShell classification. This experiment demonstrated that compared with the traditional single WebShell detection algorithm, the recall rate and accuracy rate improved to 99.14% and 94.28%, respectively, which proves that this method has better detection performance.

引用

页码：75785 / 75797

页数：13

共 50 条

[1] Malicious codes detection based on ensemble learning
Zhang, Boyun
Yin, Jianping
Hao, Jingbo
Zhang, Dingxing
Wang, Shulin
AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2007, 4610 : 468 - +
[2] A Malicious Code Detection Method Based on Ensemble Learning of Behavior
Xu X.-B.
Zhang W.-B.
He C.
Luo Y.
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (04): : 89 - 95
[3] MalFinder: An Ensemble Learning-based Framework For Malicious Traffic Detection
Rong, Candong
Gou, Gaopeng
Cui, Mingxin
Xiong, Gang
Li, Zhen
Guo, Li
2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 588 - 594
[4] Ensemble machine learning approaches for webshell detection in Internet of things environments
Yong, Binbin
Wei, Wei
Li, Kuan-Ching
Shen, Jun
Zhou, Qingguo
Wozniak, Marcin
Polap, Dawid
Damasevicius, Robertas
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (06)
[5] Explainable Ensemble Learning Based Detection of Evasive Malicious PDF Documents
Yerima, Suleiman Y.
Bashar, Abul
ELECTRONICS, 2023, 12 (14)
[6] Webshell detection based on multi-classifier ensemble model
Wenjuan-Lian
Qi-Fan
Dandan-Shi
Qili-An
Jia, Bin
Journal of Computers (Taiwan), 2020, 31 (01): : 242 - 252
[7] Malicious mining code detection based on ensemble learning in cloud computing environment
Li, Shudong
Li, Yuan
Han, Weihong
Du, Xiaojiang
Guizani, Mohsen
Tian, Zhihong
SIMULATION MODELLING PRACTICE AND THEORY, 2021, 113
[8] An Improved Ensemble Deep Learning Model Based on CNN for Malicious Website Detection
Do, Nguyet Quang
Selamat, Ali
Lim, Kok Cheng
Krejcar, Ondrej
ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 497 - 504
[9] WebShell Detection Based on CodeBERT and Deep Learning Model
Wang, Guan-Yu
Ko, Hung-Jui
Chiang, Chang-Po
Wang, Wei-Jen
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 484 - 489
[10] A Malicious Code Static Detection Framework Based on Multi-Feature Ensemble Learning
Yang W.
Gao M.
Jiang T.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (05): : 1021 - 1034

← 1 2 3 4 5 →