Feature optimization and hybrid classification for malicious web page detection

被引:2
|
作者
Deng, Weiping [1 ]
Peng, Yan [2 ]
Yang, Fan [2 ]
Song, Jun [2 ]
机构
[1] Hubei Univ Econ, Sch Informat & Commun Engn, Wuhan 430205, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
classifier fusion; feature selection; information gain; machine learning; malicious web page; URLS;
D O I
10.1002/cpe.5859
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The security threats from malicious web pages have become a hot topic for cyber security. One goal pursued by current research is to identify malicious web pages quickly, accurately, and efficiently. Considering the high detection costs and potential dimensionality curse of malicious webpage detection, in this article, we proposes a detection framework based on feature optimization and hybrid classification. It provides three properties: more new malicious webpage features, information gain-based feature selection method, and integrating multiple machine learning method. A comprehensive experimental evaluation demonstrates that the proposed framework has remarkable advantages in aspects of detection accuracy and detection performance.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A feature selection for malicious detection
    Lai Ying-xu
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 365 - 370
  • [32] Web page downloading and classification
    Tran, LQ
    Moon, CW
    Le, DX
    Thoma, GR
    FOURTEENTH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2001, : 321 - 326
  • [33] Web Page Genre Classification
    Chen, Guangyu
    Choi, Ben
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 2353 - 2357
  • [34] Using Machine Learning for Web Page Classification in Search Engine Optimization
    Matosevic, Goran
    Dobsa, Jasminka
    Mladenic, Dunja
    FUTURE INTERNET, 2021, 13 (01): : 1 - 20
  • [35] Automatic Web Page Classification
    Materna, Jiri
    RASLAN 2008: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING: SECOND WORKSHOP, 2008, : 84 - 93
  • [36] Web page genre classification
    Computer Science, Louisiana Tech University, LA 71272, United States
    Proc ACM Symp Appl Computing, (2353-2357):
  • [37] On Chinese web page classification
    Liang, JZ
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 634 - 639
  • [38] Characterization and classification of malicious Web traffic
    Goseva-Popstojanova, Katerina
    Anastasovski, Goce
    Dimitrijevikj, Ana
    Pantev, Risto
    Miller, Brandon
    COMPUTERS & SECURITY, 2014, 42 : 92 - 115
  • [39] Rough set-aided feature selection for automatic Web-page classification
    Wakaki, T
    Itakura, H
    Tamura, M
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 70 - 76
  • [40] Web Page Segmentation with Structured Prediction and its Application in Web Page Classification
    Bing, Lidong
    Guo, Rui
    Lam, Wai
    Niu, Zheng-Yu
    Wang, Haifeng
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 767 - 776