A Novel Self-Supervised Framework Based on Masked Autoencoder for Traffic Classification

被引:0
|
作者
Zhao, Ruijie [1 ]
Zhan, Mingwei [1 ]
Deng, Xianwen [1 ]
Li, Fangqi [1 ]
Wang, Yanhao [2 ]
Wang, Yijun [1 ]
Gui, Guan [3 ]
Xue, Zhi [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] QI ANXIN Technol Res Inst, Beijing 100015, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China
关键词
Network security; traffic classification; self-supervised learning; NETWORK;
D O I
10.1109/TNET.2023.3335253
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic classification is a critical task in network security and management. Recent research has demonstrated the effectiveness of the deep learning-based traffic classification method. However, the following limitations remain: (1) the traffic representation is simply generated from raw packet bytes, resulting in the absence of important information; (2) the model structure of directly applying deep learning algorithms does not take traffic characteristics into account; and (3) scenario-specific classifier training usually requires a labor-intensive and time-consuming process to label data. In this paper, we introduce a masked autoencoder (MAE) based traffic transformer with multi-level flow representation to tackle these problems. To model raw traffic data, we design a formatted traffic representation matrix with hierarchical flow information. After that, we develop an efficient Traffic Transformer, in which packet-level and flow-level attention mechanisms implement more efficient feature extraction with lower complexity. At last, we utilize MAE paradigm to pre-train our classifier with a large amount of unlabeled data, and perform fine-tuning with a few labeled data for a series of traffic classification tasks. Experiment findings reveal that our method outperforms state-of-the-art methods on five real-world traffic datasets by a large margin. The code is available at https://github.com/NSSL-SJTU/YaTC.
引用
收藏
页码:2012 / 2025
页数:14
相关论文
共 50 条
  • [1] Self-Supervised Learning Malware Traffic Classification Based on Masked Autoencoder
    Xu, Ke
    Zhang, Xixi
    Wang, Yu
    Ohtsuki, Tomoaki
    Adebisi, Bamidele
    Sari, Hikmet
    Gui, Guan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17330 - 17340
  • [2] A self-supervised learning framework based on masked autoencoder for complex wafer bin map classification
    Wang, Yi
    Ni, Dong
    Huang, Zhenyu
    Chen, Puyang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [3] Medical Image Classification Using Self-Supervised Learning-Based Masked Autoencoder
    Fan, Zong
    Wang, Zhimin
    Gong, Ping
    Lee, Christine U.
    Tang, Shanshan
    Zhang, Xiaohui
    Hao, Yao
    Zhang, Zhongwei
    Song, Pengfei
    Chen, Shigao
    Li, Hua
    [J]. MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [4] A Survey on Masked Autoencoder for Visual Self-supervised Learning
    Zhang, Chaoning
    Zhang, Chenshuang
    Song, Junha
    Yi, John Seon Keun
    Kweon, In So
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6805 - 6813
  • [5] ProteinMAE: masked autoencoder for protein surface self-supervised learning
    Yuan, Mingzhi
    Shen, Ao
    Fu, Kexue
    Guan, Jiaming
    Ma, Yingfan
    Qiao, Qin
    Wang, Manning
    [J]. BIOINFORMATICS, 2023, 39 (12)
  • [6] GMAEEG: A Self-Supervised Graph Masked Autoencoder for EEG Representation Learning
    Fu, Zanhao
    Zhu, Huaiyu
    Zhao, Yisheng
    Huan, Ruohong
    Zhang, Yi
    Chen, Shuohui
    Pan, Yun
    [J]. IEEE Journal of Biomedical and Health Informatics, 2024, 28 (11): : 6486 - 6497
  • [7] Self-Supervised Learning-Based Time Series Classification via Hierarchical Sparse Convolutional Masked-Autoencoder
    Yu, Ting
    Xu, Kele
    Wang, Xu
    Ding, Bo
    Feng, Dawei
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 964 - 975
  • [8] GAF-MAE: A Self-Supervised Automatic Modulation Classification Method Based on Gramian Angular Field and Masked Autoencoder
    Shi, Yunhao
    Xu, Hua
    Zhang, Yue
    Qi, Zisen
    Wang, Dan
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 94 - 106
  • [9] Self-Supervised Learning for 3-D Point Clouds Based on a Masked Linear Autoencoder
    Yang, Hongxin
    Wang, Ruisheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [10] Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
    Hess, Georg
    Jaxing, Johan
    Svensson, Elias
    Hagerman, David
    Petersson, Christoffer
    Svensson, Lennart
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 350 - 359