SHAPE: A Simultaneous Header and Payload Encoding Model for Encrypted Traffic Classification

被引:5
|
作者
Dai, Jianbang [1 ]
Xu, Xiaolong [2 ]
Gao, Honghao [3 ]
Wang, Xinheng [4 ]
Xiao, Fu [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[3] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[4] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic classification; encrypted traffic; autoencoder; transformer; deep metric learning; NETWORK;
D O I
10.1109/TNSM.2022.3213758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many end-to-end deep learning algorithms seeking to classify malicious traffic and encrypted traffic have been proposed in recent years. End-to-end deep learning algorithms require a large number of samples to train a model. However, it is hard for existing methods fully utilizing the heterogeneous multimodal input. To this end, we propose the SHAPE model (simultaneous header and payload encoding), which mainly consists of two autoencoders and a transformer layer, to improve model performance. The two auto encoders extract features from heterogeneous inputs-the statistical information of each packet and byte-form payloads-and convert them into a unified format; then, a lightweight Transformers layer further extracts the relationship hidden in simultaneous input. In particular, the autoencoder for payload feature extraction contains several depthwise separable residual convolution layers for efficient feature extraction and a token squeeze layer to reduce the computing overhead of the Transformers layer. Moreover, we train the SHAPE model using deep metric learning, which pulls samples with the same class label together and separates samples from different classes in the low-dimensional embedding space. Thus, the SHAPE model can naturally handle multitask classification, and its performance is approximately 5.43% better than the current SOTA on the traffic type classification of the ISCX-VPN2016 dataset, at the cost of 9.31 times the training time, and 1.45 times the inference time.
引用
收藏
页码:1993 / 2012
页数:20
相关论文
共 50 条
  • [1] Payload Encoding Representation from Transformer for Encrypted Traffic Classification
    HE Hongye
    YANG Zhiguo
    CHEN Xiangning
    ZTE Communications, 2021, 19 (04) : 90 - 97
  • [2] Tor Traffic Classification Based on Encrypted Payload Characteristics
    Choorod, Pitpimon
    Weir, George
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1107 - +
  • [3] Only Header: a reliable encrypted traffic classification framework without privacy risk
    Susu Cui
    Jian Liu
    Cong Dong
    Zhigang Lu
    Dan Du
    Soft Computing, 2022, 26 : 13391 - 13403
  • [4] Only Header: a reliable encrypted traffic classification framework without privacy risk
    Cui, Susu
    Liu, Jian
    Dong, Cong
    Lu, Zhigang
    Du, Dan
    SOFT COMPUTING, 2022, 26 (24) : 13391 - 13403
  • [5] Encrypted Voice Traffic Fingerprinting: An Adaptive Network Traffic Feature Encoding Model
    Zhou, Tianci
    Zeng, Yong
    Chen, Yitao
    Liu, Zhihong
    Ma, Jianfeng
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3768 - 3773
  • [6] A Survey on Encrypted Traffic Classification
    Cao, Zigang
    Xiong, Gang
    Zhao, Yong
    Li, Zhenzhen
    Guo, Li
    APPLICATIONS AND TECHNIQUES IN INFORMATION SECURITY, ATIS 2014, 2014, 490 : 73 - 81
  • [7] Realtime Classification for Encrypted Traffic
    Bar-Yanai, Roni
    Langberg, Michael
    Peleg, David
    Roditty, Liam
    EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2010, 6049 : 373 - +
  • [8] Encrypted Network Traffic Classification Using a Geometric Learning Model
    Huoh, Ting-Li
    Luo, Yan
    Zhang, Tong
    2021 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2021), 2021, : 376 - 383
  • [9] Classifying Tor Traffic Encrypted Payload Using Machine Learning
    Choorod, Pitpimon
    Weir, George
    Fernando, Anil
    IEEE ACCESS, 2024, 12 : 19418 - 19431
  • [10] Is Encrypted ClientHello a Challenge for Traffic Classification?
    Shamsimukhametov, Danil
    Kurapov, Anton
    Liubogoshchev, Mikhail
    Khorov, Evgeny
    IEEE ACCESS, 2022, 10 : 77883 - 77897