WS-LSMR: Malicious WebShell Detection Algorithm Based on Ensemble Learning

被引:10
|
作者
Ai, Zhuang [1 ]
Luktarhan, Nurbol [1 ]
Zhao, Yuxin [2 ]
Tang, Chaofei [2 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Xinjiang Univ, Coll Software, Urumqi 830046, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Machine learning algorithms; Forestry; Trojan horses; Training; Adaptation models; Prediction algorithms; Ensemble learning; information entropy; WebShell;
D O I
10.1109/ACCESS.2020.2989304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve the problem that the features produced by hidden means, such as code obfuscation and compression, in encrypted malicious WebShell files are not the same as those produced by non-encrypted files, a WebShell attack detection algorithm based on ensemble learning is proposed. First, this algorithm extracted the feature vocabulary of the unigrams and 4-grams based on opcode; subsequently, the 4-gram feature word weights were obtained according to the calculated Gini coefficient of the unigram feature words and used to select the features, which will be selected again based on the Gini coefficient of the 4-gram feature words. Consequently, a feature vocabulary that can detect encrypted and unencrypted WebShell files was constructed. Second, in order to improve the adaptability and accuracy of the detection method, an ensemble detection model called WS-LSMR, consisting of a Logistic Regression, Support Vector Machine, Multi-layer Perceptron and Random Forest, was constructed. The model uses a weighted voting method to determine the WebShell classification. This experiment demonstrated that compared with the traditional single WebShell detection algorithm, the recall rate and accuracy rate improved to 99.14% and 94.28%, respectively, which proves that this method has better detection performance.
引用
收藏
页码:75785 / 75797
页数:13
相关论文
共 50 条
  • [31] The Algorithm of Malicious Code Detection Based on Data Mining
    Yang, Yubo
    Zhao, Yang
    Liu, Xiabi
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [32] Malicious URLs detection based on a novel optimization algorithm
    Bo W.
    Fang Z.B.
    Wei L.X.
    Cheng Z.F.
    Hua Z.X.
    IEICE Transactions on Information and Systems, 2021, E104.D (04): : 513 - 516
  • [33] Malicious webpages analysis and detection algorithm based on BiLSTM
    Wang H.-H.
    Yu L.
    Tian S.-W.
    Luo S.-Q.
    Pei X.-J.
    International Journal of Electronic Business, 2020, 15 (04) : 351 - 367
  • [34] A Malicious Webpage Detection Algorithm Based on Image Semantics
    Li, Xiangjun
    Li, Sifan
    Liu, Shengnan
    Liu, Lingfeng
    He, Daojing
    TRAITEMENT DU SIGNAL, 2020, 37 (01) : 113 - 118
  • [35] Malicious URLs Detection Based on a Novel Optimization Algorithm
    Wang, Bo
    Zhang, B. Fang
    Liu, X. Wei
    Zou, F. Cheng
    Zhang, X. Hua
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (04): : 513 - 516
  • [36] Unknown malicious detection based on improved Bayes algorithm
    Lai, Ying-Xu
    Yang, Zhen
    Beijing Gongye Daxue Xuebao/Journal of Beijing University of Technology, 2011, 37 (05): : 766 - 772
  • [37] Deep Learning Based Webshell Detection Coping with Long Text and Lexical Ambiguity
    An, Tongjian
    Shui, Xuefei
    Gao, Hongkui
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2022, 2022, 13407 : 438 - 457
  • [38] Webshell Traffic Detection With Character-Level Features Based on Deep Learning
    Zhang, Hua
    Guan, Hongchao
    Yan, Hanbing
    Li, Wenmin
    Yu, Yuqi
    Zhou, Hao
    Zeng, Xingyu
    IEEE ACCESS, 2018, 6 : 75268 - 75277
  • [39] MSDetector: A Static PHP Webshell Detection System Based on Deep-Learning
    Cheng, Baijun
    Guo, Yanhui
    Ren, Yan
    Yang, Gang
    Xu, Guosheng
    THEORETICAL ASPECTS OF SOFTWARE ENGINEERING, TASE 2022, 2022, 13299 : 155 - 172
  • [40] EMDG-FL: Enhanced Malicious Model Detection based on Genetic Algorithm for Federated Learning
    Ben Atia, Okba
    Al Samara, Mustafa
    Bennis, Ismail
    Gaber, Jaafar
    Abouaissa, Abdelhafid
    Lorenz, Pascal
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,