A clustering-based approach for classifying data streams using graph matching

被引:0
|
作者
Du, Yuxin [1 ]
He, Mingshu [2 ]
Wang, Xiaojuan [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Cyberspace Secur, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Coarse-grained clustering; Traffic classification; Graph matching algorithm; Primary features; ENCRYPTED TRAFFIC CLASSIFICATION; NETWORK; SCHEME;
D O I
10.1186/s40537-025-01087-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In response to challenges such as data encryption, uneven distribution, and user privacy concerns in network traffic classification, this paper presents a clustering-based approach.In response to challenges such as data encryption, uneven distribution, and user privacy concerns in network traffic classification, this paper presents a clustering-based approach. The proposed method utilizes a graph matching approach to effectively categorize data streams in real-time scenarios. This approach aims to enhance the accuracy and efficiency of network traffic classification, particularly in the face of evolving encryption techniques and privacy-preserving measures. The method relies solely on non-content features to characterize network flow characteristics and employs graph matching algorithms to reduce inter-class imbalances, enabling coarse-grained clustering and reliable graph matching. Firstly, an unsupervised clustering framework is designed, which studies the diverse distributions and category similarities of traffic data based on a limited set of features. This unsupervised clustering helps mitigate network disparities by aggregating network sessions into a few clusters with extracted primary features. Next, the correlation between clusters from the same network is used to construct a similarity graph. Finally, a graph matching algorithm is proposed, which combines graph neural networks and graph matching networks to reveal reliable correspondences between different network relationships. This allows for associating clusters in the test network with clusters in the initial network, enabling the labeling of test clusters based on associated clusters in the training set. Simulation results demonstrate that the proposed method achieves an accuracy rate of 96.8%, which is significantly superior to existing approaches.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] A Hybrid Approach for Clustering-based Data Aggregation in Wireless Sensor Networks
    Jung, Woo-Sung
    Lim, Keun-Woo
    Ko, Young-Bae
    Park, Sang-Joon
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL SOCIETY: ICDS 2009, PROCEEDINGS, 2009, : 112 - 117
  • [32] A clustering-based approach to vortex extraction
    Deng, Liang
    Wang, Yueqing
    Chen, Cheng
    Liu, Yang
    Wang, Fang
    Liu, Jie
    JOURNAL OF VISUALIZATION, 2020, 23 (03) : 459 - 474
  • [33] ICN clustering-based approach for VANETs
    Lamia Chaari Fourati
    Samiha Ayed
    Mohamed Ali Ben Rejeb
    Annals of Telecommunications, 2021, 76 : 745 - 757
  • [34] ICN clustering-based approach for VANETs
    Fourati, Lamia Chaari
    Ayed, Samiha
    Ben Rejeb, Mohamed Ali
    ANNALS OF TELECOMMUNICATIONS, 2021, 76 (9-10) : 745 - 757
  • [35] A Clustering-Based Approach to Ontology Alignment
    Duan, Songyun
    Fokoue, Achille
    Srinivas, Kavitha
    Byrne, Brian
    SEMANTIC WEB - ISWC 2011, PT I, 2011, 7031 : 146 - +
  • [36] Spectral clustering-based community detection using graph distance and node attributes
    Fengqin Tang
    Chunning Wang
    Jinxia Su
    Yuanyuan Wang
    Computational Statistics, 2020, 35 : 69 - 94
  • [37] ClubCF: A Clustering-Based Collaborative Filtering Approach for Big Data Application
    Hu, Rong
    Dou, Wanchun
    Liu, Jianxun
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 302 - 313
  • [38] A Novel Approach for Clustering Data Streams Using Granularity Technique
    Kaneriya, Ankur
    Shukla, Madhu
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 586 - 590
  • [39] A clustering-based approach to vortex extraction
    Liang Deng
    Yueqing Wang
    Cheng Chen
    Yang Liu
    Fang Wang
    Jie Liu
    Journal of Visualization, 2020, 23 : 459 - 474
  • [40] Clustering-based privacy preserving anonymity approach for table data sharing
    Chunhui Piao
    Liping Liu
    Yajuan Shi
    Xuehong Jiang
    Ning Song
    International Journal of System Assurance Engineering and Management, 2020, 11 : 768 - 773