Bot Classification for Real-Life Highly Class-Imbalanced Dataset

被引:1
|
作者
Harun, Sarah [1 ]
Bhuiyan, Tanveer Hossain [1 ]
Zhang, Song [1 ]
Medal, Hugh [1 ]
Bian, Linkan [1 ]
机构
[1] Mississippi State Univ, Starkville, MS 39762 USA
关键词
Bot detection; malware detection; feature extraction; imbalanced dataset; classification; network traffic;
D O I
10.1109/DASC-PICom-DataCom-CyberSciTec.2017.102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Botnets are networks formed with a number of machines infected by malware called bots. Detection of these malicious networks is becoming a major concern as they pose a serious threat to the network security. Most of the research on bot detection is based on particular botnet characteristics which fail to detect other types of botnets and bots. Furthermore, there are very few bot detection methods that considered real-life class-imbalanced dataset. A dataset is class-imbalanced if there are significantly more instances in one class than the other classes. In this paper, we develop three generic features to detect different types of bots regardless of their botnet characteristics. We develop five classification models based on those features to classify bots from a large, real-life, class-imbalanced network dataset. Results show that our methodology can detect bots more accurately than the existing methods. Experimental results also demonstrate that the developed methodology can successfully detect bots when the proportion of bots to normal activity is very small. We also provide a performance comparison of our methodology with a recent study on bot detection in a real-life, large, imbalanced dataset.
引用
收藏
页码:565 / 572
页数:8
相关论文
共 50 条
  • [1] A Hybrid Framework for Class-Imbalanced Classification
    Chen, Rui
    Luo, Lailong
    Chen, Yingwen
    Xia, Junxu
    Guo, Deke
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 301 - 313
  • [2] MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network
    Ali-Gombe, Adamu
    Elyan, Eyad
    [J]. NEUROCOMPUTING, 2019, 361 : 212 - 221
  • [3] REAL-TIME EQUIPMENT CONDITION ASSESSMENT FOR A CLASS-IMBALANCED DATASET BASED ON HETEROGENEOUS ENSEMBLE LEARNING
    Chen, Xiaohui
    Zhang, Zhiyao
    Zhang, Ze
    [J]. EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2019, 21 (01): : 68 - 80
  • [4] Improving Classification Performance for the Minority Class in Highly Imbalanced Dataset using Boosting
    Abouelenien, Mohamed
    Yuan, Xiaohui
    Duraisamy, Prakash
    Yuan, Xiaojing
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [5] HOVER: Homophilic Oversampling via Edge Removal for Class-Imbalanced Bot Detection on Graphs
    Ashmore, Bradley
    Chen, Lingwei
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3728 - 3732
  • [6] GraphSHA: Synthesizing Harder Samples for Class-Imbalanced Node Classification
    Li, Wen-Zhi
    Wang, Chang-Dong
    Xiong, Hui
    Lai, Jian-Huang
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1328 - 1340
  • [7] Subclass-based Undersampling for Class-imbalanced Image Classification
    Lehmann, Daniel
    Ebner, Marc
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 493 - 500
  • [8] Improving the Performance of an Associative Classifier in the Context of Class-Imbalanced Classification
    Rolon-Gonzalez, Carlos Alberto
    Castanon-Mendez, Rodrigo
    Alarcon-Paredes, Antonio
    Lopez-Yanez, Itzama
    Yanez-Marquez, Cornelio
    [J]. ELECTRONICS, 2021, 10 (09)
  • [9] Class-Imbalanced Graph Convolution Smoothing for Hyperspectral Image Classification
    Ding, Yun
    Chong, Yanwen
    Pan, Shaoming
    Zheng, Chun-Hou
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 18
  • [10] A classification method for class-imbalanced data and its application on bioinformatics
    Zou, Quan
    Guo, Maozu
    Liu, Yang
    Wang, Jun
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (08): : 1407 - 1414