Improving Multiclass Classification of Cybersecurity Breaches in Railway Infrastructure using Imbalanced Learning

被引:0
|
作者
Nebaba, Aleksandr N. [1 ]
Savvas, Ilias K. [2 ]
Butakova, Maria A. [3 ]
Chernov, Andrey V. [3 ]
Shevchuk, Petr S. [4 ]
机构
[1] Rostov State Transport Univ, Rostov Na Donu, Russia
[2] Univ Thessaly, Sch Technol, Dept Digital Syst, Larisa, Greece
[3] Southern Fed Univ, Smart Mat Res Inst, Rostov Na Donu, Russia
[4] Don State Tech Univ, Rostov Na Donu, Russia
基金
俄罗斯基础研究基金会;
关键词
Multiclass classification; Machine learning; Imbalanced learning; Cybersecurity breaches; Railway infrastructure;
D O I
10.1145/3501774.3501789
中图分类号
学科分类号
摘要
Machine learning approaches and algorithms are spreading in wide areas in research and technology. Cybersecurity breaches are the common anomalies for networked and distributed infrastructures which are monitored, registered, and described carefully. However, the description of each security breaches episode and its classification is still a difficult problem, especially in highly complex telecommunication infrastructure. Railway information infrastructure usually has a large scale and large diversity of possible security breaches. Today's situation shows the registering of the security breaches has a mature and stable character, but the problem of their automated classification is not solved completely. Many studies on security breaches multiclass classification show inadequate accuracy of classification. We investigated the origins of this problem and suggested the possible roots consist in disbalance the datasets used for machine learning multiclass classification. Thus, we proposed an approach to improve the accuracy of the classification and verified our approach on the really collected datasets with cybersecurity breaches in railway telecommunication infrastructure. We analyzed the results of applying three imbalanced learning methodologies, namely random oversampling, synthetic minority oversampling technique, and the last one with Tomek links. We have implemented three machine learning algorithms, namely Naive Bayes, K-means, and support vector machine, on disbalances and balanced data to estimate imbalance learning methodologies with comparing results. The proposed approach demonstrated the increase of the accuracy for multiclass classification in the range from 30 to 41%, depending on the imbalanced learning technique.
引用
下载
收藏
页码:100 / 105
页数:6
相关论文
共 50 条
  • [1] Active learning with extreme learning machine for online imbalanced multiclass classification
    Qin, Jiongming
    Wang, Cong
    Zou, Qinhong
    Sun, Yubin
    Chen, Bin
    KNOWLEDGE-BASED SYSTEMS, 2021, 231
  • [2] Robust Multiclass Classification for Learning from Imbalanced Biomedical Data
    Piyaphol Phoungphol
    Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
  • [3] Robust multiclass classification for learning from imbalanced biomedical data
    Phoungphol, Piyaphol
    Zhang, Yanqing
    Zhao, Yichuan
    Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
  • [4] Imbalanced multiclass classification with active learning in strip rolling process
    Deng, Jifei
    Sun, Jie
    Peng, Wen
    Zhang, Dianhua
    Vyatkin, Valeriy
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [5] Improving Imbalanced Dialogue Act Classification Using Cost-Sensitive Learning
    Miyagi, Takaaki
    Endo, Satoshi
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [6] Improving Imbalanced Text Classification with Dynamic Curriculum Learning
    Zhang, Xulong
    Wang, Jianzong
    Cheng, Ning
    Xiao, Jing
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 1031 - 1036
  • [7] Improving Multiclass Classification in Crowdsourcing by Using Hierarchical Schemes
    Duan, Xiaoni
    Tajima, Keishi
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2694 - 2700
  • [8] Binary and Multiclass Imbalanced Classification Using Multi-Objective Ant Programming
    Luis Olmo, Juan
    Cano, Alberto
    Raul Romero, Jose
    Ventura, Sebastian
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 70 - 76
  • [9] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
    Xiao, Z.
    Wang, L.
    Du, J. Y.
    IEEE ACCESS, 2019, 7 : 28281 - 28290
  • [10] Multiclass imbalanced and concept drift network traffic classification framework based on online active learning
    Liu, Weike
    Zhu, Cheng
    Ding, Zhaoyun
    Zhang, Hang
    Liu, Qingbao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117