A clustering-based approach for classifying data streams using graph matching

被引:0
|
作者
Du, Yuxin [1 ]
He, Mingshu [2 ]
Wang, Xiaojuan [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Cyberspace Secur, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Coarse-grained clustering; Traffic classification; Graph matching algorithm; Primary features; ENCRYPTED TRAFFIC CLASSIFICATION; NETWORK; SCHEME;
D O I
10.1186/s40537-025-01087-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In response to challenges such as data encryption, uneven distribution, and user privacy concerns in network traffic classification, this paper presents a clustering-based approach.In response to challenges such as data encryption, uneven distribution, and user privacy concerns in network traffic classification, this paper presents a clustering-based approach. The proposed method utilizes a graph matching approach to effectively categorize data streams in real-time scenarios. This approach aims to enhance the accuracy and efficiency of network traffic classification, particularly in the face of evolving encryption techniques and privacy-preserving measures. The method relies solely on non-content features to characterize network flow characteristics and employs graph matching algorithms to reduce inter-class imbalances, enabling coarse-grained clustering and reliable graph matching. Firstly, an unsupervised clustering framework is designed, which studies the diverse distributions and category similarities of traffic data based on a limited set of features. This unsupervised clustering helps mitigate network disparities by aggregating network sessions into a few clusters with extracted primary features. Next, the correlation between clusters from the same network is used to construct a similarity graph. Finally, a graph matching algorithm is proposed, which combines graph neural networks and graph matching networks to reveal reliable correspondences between different network relationships. This allows for associating clusters in the test network with clusters in the initial network, enabling the labeling of test clusters based on associated clusters in the training set. Simulation results demonstrate that the proposed method achieves an accuracy rate of 96.8%, which is significantly superior to existing approaches.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Clustering based approach for incomplete data streams processing
    Najib, Fatma M.
    Ismail, Rasha M.
    Badr, Nagwa L.
    Gharib, Tarek F.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 3213 - 3227
  • [22] Fuzzy Clustering-Based Task Allocation Approach Using Bipartite Graph in Cloud-Fog Environment
    Gad-Elrab, Ahmed A. A.
    Noaman, Amin Y.
    PROCEEDINGS OF THE 16TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS'19), 2019, : 454 - 463
  • [23] An Enhanced Motif Graph Clustering-Based Deep Learning Approach for Traffic Forecasting
    Zhang, Chenhan
    Zhang, Shuyu
    Yu, James J. Q.
    Yu, Shui
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [24] A Graph and Trace Clustering-based Approach for Abstracting Mined Business Process Models
    Sun, Yaguang
    Bauer, Bernhard
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1 (ICEIS), 2016, : 63 - 74
  • [25] Detecting Data Accuracy Issues in Textual Geographical Data by a Clustering-based Approach
    Pellegrino, Maria Angela
    Postiglione, Luca
    Scarano, Vittorio
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 208 - 212
  • [26] GenMatcher: A Generic Clustering-Based Arbitrary Matching Framework
    Wang, Ping
    Mchale, Luke
    Gratz, Paul, V
    Sprintson, Alex
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 15 (04)
  • [27] Approximating I/O data using radial basis functions:: A new clustering-based approach
    Awad, M
    Pomares, H
    Herrera, LJ
    González, J
    Guillén, A
    Rojas, F
    COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 289 - 296
  • [28] Spectral clustering-based community detection using graph distance and node attributes
    Tang, Fengqin
    Wang, Chunning
    Su, Jinxia
    Wang, Yuanyuan
    COMPUTATIONAL STATISTICS, 2020, 35 (01) : 69 - 94
  • [29] Conference scheduling: A clustering-based approach
    Bulhoes, Teobaldo
    Correia, Rubens
    Subramanian, Anand
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 297 (01) : 15 - 26
  • [30] Clustering-based privacy preserving anonymity approach for table data sharing
    Piao, Chunhui
    Liu, Liping
    Shi, Yajuan
    Jiang, Xuehong
    Song, Ning
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2020, 11 (04) : 768 - 773