Improving Multiclass Classification of Cybersecurity Breaches in Railway Infrastructure using Imbalanced Learning

被引：0

作者：

Nebaba, Aleksandr N. ^{[1
]}

Savvas, Ilias K. ^{[2
]}

Butakova, Maria A. ^{[3
]}

Chernov, Andrey V. ^{[3
]}

Shevchuk, Petr S. ^{[4
]}

机构：

[1] Rostov State Transport Univ, Rostov Na Donu, Russia

[2] Univ Thessaly, Sch Technol, Dept Digital Syst, Larisa, Greece

[3] Southern Fed Univ, Smart Mat Res Inst, Rostov Na Donu, Russia

[4] Don State Tech Univ, Rostov Na Donu, Russia

来源：

ESSE 2021: THE 2ND EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING | 2021年

基金：

俄罗斯基础研究基金会;

关键词：

Multiclass classification; Machine learning; Imbalanced learning; Cybersecurity breaches; Railway infrastructure;

D O I：

10.1145/3501774.3501789

中图分类号：

学科分类号：

摘要：

Machine learning approaches and algorithms are spreading in wide areas in research and technology. Cybersecurity breaches are the common anomalies for networked and distributed infrastructures which are monitored, registered, and described carefully. However, the description of each security breaches episode and its classification is still a difficult problem, especially in highly complex telecommunication infrastructure. Railway information infrastructure usually has a large scale and large diversity of possible security breaches. Today's situation shows the registering of the security breaches has a mature and stable character, but the problem of their automated classification is not solved completely. Many studies on security breaches multiclass classification show inadequate accuracy of classification. We investigated the origins of this problem and suggested the possible roots consist in disbalance the datasets used for machine learning multiclass classification. Thus, we proposed an approach to improve the accuracy of the classification and verified our approach on the really collected datasets with cybersecurity breaches in railway telecommunication infrastructure. We analyzed the results of applying three imbalanced learning methodologies, namely random oversampling, synthetic minority oversampling technique, and the last one with Tomek links. We have implemented three machine learning algorithms, namely Naive Bayes, K-means, and support vector machine, on disbalances and balanced data to estimate imbalance learning methodologies with comparing results. The proposed approach demonstrated the increase of the accuracy for multiclass classification in the range from 30 to 41%, depending on the imbalanced learning technique.

引用

下载

页码：100 / 105

页数：6

共 50 条

[1] Active learning with extreme learning machine for online imbalanced multiclass classification
Qin, Jiongming
Wang, Cong
Zou, Qinhong
Sun, Yubin
Chen, Bin
KNOWLEDGE-BASED SYSTEMS, 2021, 231
[2] Robust Multiclass Classification for Learning from Imbalanced Biomedical Data
Piyaphol Phoungphol
Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
[3] Robust multiclass classification for learning from imbalanced biomedical data
Phoungphol, Piyaphol
Zhang, Yanqing
Zhao, Yichuan
Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
[4] Imbalanced multiclass classification with active learning in strip rolling process
Deng, Jifei
Sun, Jie
Peng, Wen
Zhang, Dianhua
Vyatkin, Valeriy
KNOWLEDGE-BASED SYSTEMS, 2022, 255
[5] Improving Imbalanced Dialogue Act Classification Using Cost-Sensitive Learning
Miyagi, Takaaki
Endo, Satoshi
2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
[6] Improving Imbalanced Text Classification with Dynamic Curriculum Learning
Zhang, Xulong
Wang, Jianzong
Cheng, Ning
Xiao, Jing
2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 1031 - 1036
[7] Improving Multiclass Classification in Crowdsourcing by Using Hierarchical Schemes
Duan, Xiaoni
Tajima, Keishi
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2694 - 2700
[8] Binary and Multiclass Imbalanced Classification Using Multi-Objective Ant Programming
Luis Olmo, Juan
Cano, Alberto
Raul Romero, Jose
Ventura, Sebastian
2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 70 - 76
[9] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
Xiao, Z.
Wang, L.
Du, J. Y.
IEEE ACCESS, 2019, 7 : 28281 - 28290
[10] Multiclass imbalanced and concept drift network traffic classification framework based on online active learning
Liu, Weike
Zhu, Cheng
Ding, Zhaoyun
Zhang, Hang
Liu, Qingbao
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117

← 1 2 3 4 5 →