SHAPE: A Simultaneous Header and Payload Encoding Model for Encrypted Traffic Classification

被引:5
|
作者
Dai, Jianbang [1 ]
Xu, Xiaolong [2 ]
Gao, Honghao [3 ]
Wang, Xinheng [4 ]
Xiao, Fu [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[3] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[4] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic classification; encrypted traffic; autoencoder; transformer; deep metric learning; NETWORK;
D O I
10.1109/TNSM.2022.3213758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many end-to-end deep learning algorithms seeking to classify malicious traffic and encrypted traffic have been proposed in recent years. End-to-end deep learning algorithms require a large number of samples to train a model. However, it is hard for existing methods fully utilizing the heterogeneous multimodal input. To this end, we propose the SHAPE model (simultaneous header and payload encoding), which mainly consists of two autoencoders and a transformer layer, to improve model performance. The two auto encoders extract features from heterogeneous inputs-the statistical information of each packet and byte-form payloads-and convert them into a unified format; then, a lightweight Transformers layer further extracts the relationship hidden in simultaneous input. In particular, the autoencoder for payload feature extraction contains several depthwise separable residual convolution layers for efficient feature extraction and a token squeeze layer to reduce the computing overhead of the Transformers layer. Moreover, we train the SHAPE model using deep metric learning, which pulls samples with the same class label together and separates samples from different classes in the low-dimensional embedding space. Thus, the SHAPE model can naturally handle multitask classification, and its performance is approximately 5.43% better than the current SOTA on the traffic type classification of the ISCX-VPN2016 dataset, at the cost of 9.31 times the training time, and 1.45 times the inference time.
引用
收藏
页码:1993 / 2012
页数:20
相关论文
共 50 条
  • [31] EAPT: An encrypted traffic classification model via adversarial pre-trained transformers
    Zhan, Mingming
    Yang, Jin
    Jia, Dongqing
    Fu, Geyuan
    Computer Networks, 2025, 257
  • [32] Pattern Matching of Packet Payload for Network Traffic Classification
    Choi, Kwangjin
    Choi, Jun Kyun
    2006 THE JOINT INTERNATIONAL CONFERENCE ON OPTICAL INTERNET (COIN) AND NEXT GENERATION NETWORK (NGNCON), 2006, : 394 - 396
  • [33] A Survey of Payload-Based Traffic Classification Approaches
    Finsterbusch, Michael
    Richter, Chris
    Rocha, Eduardo
    Mueller, Jean-Alexander
    Haenssgen, Klaus
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (02): : 1135 - 1156
  • [34] Deep Learning Classification for Encrypted Botnet Traffic: Optimising Model Performance and Resource Utilisation
    Carr, Lucas
    Chavula, Josiah
    SOUTH AFRICAN COMPUTER SCIENCE AND INFORMATION SYSTEMS RESEARCH TRENDS, SAICSIT 2024, 2024, 2159 : 3 - 29
  • [35] Payload Signature Structure for Accurate Application Traffic Classification
    Goo, Young-Hoon
    Shim, Kyu-Seok
    Lee, Su-Kang
    Kim, Myung-Sup
    2016 18TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2016,
  • [36] HEDGE: Efficient Traffic Classification of Encrypted and Compressed Packets
    Casino, Fran
    Choo, Kim-Kwang Raymond
    Patsakis, Constantinos
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (11) : 2916 - 2926
  • [37] FedETC: Encrypted traffic classification based on federated learning
    Jin, Zhiping
    Duan, Ke
    Chen, Changhui
    He, Meirong
    Jiang, Shan
    Xue, Hanxiao
    HELIYON, 2024, 10 (16)
  • [38] Investigating Two Different Approaches for Encrypted Traffic Classification
    Alshammari, Riyad
    Zincir-Heywood, A. Nur
    SIXTH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, PROCEEDINGS, 2008, : 156 - 166
  • [39] User Behavior Classification in Encrypted Cloud Camera Traffic
    Wang, Jibao
    Cao, Zigang
    Kang, Cuicui
    Xiong, Gang
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [40] Packet header-based reweight-long short term memory (Rew-LSTM) method for encrypted network traffic classification
    Hou, Jiangang
    Li, Xin
    Xu, Hongji
    Wang, Chun
    Cui, Lizhen
    Liu, Zhi
    Hu, Changzhen
    COMPUTING, 2024, 106 (08) : 2875 - 2896