MSML: A Novel Multilevel Semi-Supervised Machine Learning Framework for Intrusion Detection System

被引:69
|
作者
Yao, Haipeng [1 ]
Fu, Danyang [1 ]
Zhang, Peiying [2 ,3 ]
Li, Maozhen [4 ]
Liu, Yunjie [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] China Univ Petr East China, Coll Comp & Commun Engn, Qingdao 266580, Shandong, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[4] Brunel Univ, Dept Elect & Comp Engn, Uxbridge UB8 3PH, Middx, England
来源
IEEE INTERNET OF THINGS JOURNAL | 2019年 / 6卷 / 02期
基金
中国国家自然科学基金;
关键词
Class imbalance; intrusion detection; nonidentical distribution; semi-supervised learning; unknown pattern discovery; FEATURE-SELECTION;
D O I
10.1109/JIOT.2018.2873125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Intrusion detection technology has received increasing attention in recent years. Many researchers have proposed various intrusion detection systems using machine learning (ML) methods. However, there are two noteworthy factors affecting the robustness of the model. One is the severe imbalance of network traffic in different categories and the other is the nonidentical distribution between training set and test set in feature space. This paper presents a multilevel intrusion detection model framework named multilevel semi-supervised ML (MSML) to address these issues. The MSML framework includes four modules: 1) pure cluster extraction; 2) pattern discovery; 3) fine-grained classification (FC); and 4) model updating. In the pure cluster module, we introduce an concept of "pure cluster" and propose a hierarchical semi-supervised k-means algorithm with an aim to find out all the pure clusters. In the pattern discovery module, we define the "unknown pattern" and apply cluster-based method aiming to find those unknown patterns. Then a test sample is sentenced to labeled known pattern or unlabeled unknown pattern. The FC module can achieves FC for those unknown pattern samples. The model updating module provides a mechanism for retraining. KDDCUP99 dataset is applied to evaluate MSML. Experimental results show that MSML is superior to other existing intrusion detection models in terms of overall accuracy, F1-score, and unknown pattern recognition capability.
引用
收藏
页码:1949 / 1959
页数:11
相关论文
共 50 条
  • [1] Semi-supervised machine learning framework for network intrusion detection
    Jieling Li
    Hao Zhang
    Yanhua Liu
    Zhihuang Liu
    [J]. The Journal of Supercomputing, 2022, 78 : 13122 - 13144
  • [2] Semi-supervised machine learning framework for network intrusion detection
    Li, Jieling
    Zhang, Hao
    Liu, Yanhua
    Liu, Zhihuang
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (11): : 13122 - 13144
  • [3] A Novel Distributed Machine Learning Framework for Semi-Supervised Detection of Botnet Attacks
    Kaur, Gagandeep
    [J]. 2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 233 - 239
  • [4] Effective Intrusion Detection System Using Semi-Supervised Learning
    Wagh, Sharmila Kishor
    Kolhe, Satish R.
    [J]. 2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
  • [5] A semi-supervised learning model for intrusion detection
    Jiang, Eric P.
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (03): : 343 - 353
  • [6] Effective semi-supervised approach towards intrusion detection system using machine learning techniques
    Wagh, Sharmila Kishor
    Kolhe, Satish R.
    [J]. INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2015, 7 (03) : 290 - 304
  • [7] Predicting Unlabeled Traffic For Intrusion Detection Using Semi-Supervised Machine Learning
    Murthy, Chidananda P.
    Manjunatha, A. S.
    Jaiswal, Anku
    Madhu, B. R.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 218 - 222
  • [8] Fuzziness based semi-supervised learning approach for intrusion detection system
    Ashfaq, Rana Aamir Raza
    Wang, Xi-Zhao
    Huang, Joshua Zhexue
    Abbas, Haider
    He, Yu-Lin
    [J]. INFORMATION SCIENCES, 2017, 378 : 484 - 497
  • [9] A semi-supervised interpretable machine learning framework for sensor fault detection
    Martakis, Panagiotis
    Movsessian, Artur
    Reuland, Yves
    Pai, Sai G. S.
    Quqa, Said
    Cava, David Garcia
    Tcherniak, Dmitri
    Chatzi, Eleni
    [J]. SMART STRUCTURES AND SYSTEMS, 2022, 29 (01) : 251 - 266
  • [10] Semi-Supervised Learning Methods for Network Intrusion Detection
    Chen, Chuanliang
    Gong, Yunchao
    Tian, Yingjie
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2602 - +