Detecting multi-stage attacks using sequence-to-sequence model

被引:13
|
作者
Zhou, Peng [1 ]
Zhou, Gongyan [1 ]
Wu, Dakui [1 ]
Fei, Minrui [1 ]
机构
[1] Shanghai Univ, Shanghai Key Lab Power Stn Automat Technol, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Multi-stage attack; Intrusion detection; Sequence-to-sequence model; Encoder-decoder architecture; Long-short term memory (LSTM) network;
D O I
10.1016/j.cose.2021.102203
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-stage attack is a kind of sophisticated intrusion strategy that has been widely used for penetrating the well protected network infrastructures. To detect such attacks, state-of-theart research advocates the use of hidden markov model (HMM). However, despite the HMM can model the relationships and dependencies among different alerts and stages for detection, they cannot handle well the stage dependencies buried in a longer sequence of alerts. In this paper, we tackle the challenge of the stages' long-term dependency and propose a new detection solution using a sequence-to-sequence (seq2seq) model. The basic idea is to encode a sequence of alerts (i.e., detector's observation) into a latent feature vector using a long-short term memory (LSTM) network and then decode this vector to a sequence of predicted attacking stages with another LSTM. By the encoder-decoder collaboration, we can decouple the local constraint between the observed alerts and the potential attacking stages, and thus able to take the full knowledge of all the alerts for the detection of stages in a sequence basis. By the LSTM, we can learn to "forget" irrelevant alerts and thereby have more opportunities to "remember" the long-term dependency between different stages for our sequence detection. To evaluate our model's effectiveness, we have conducted extensive experiments using four public datasets, all of which include simulated or re-constructed samples of real-world multi-stage attacks in controlled testbeds. Our results have successfully confirmed the better detection performance of our model compared with the previous HMM solutions. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Direct speech-to-speech translation with a sequence-to-sequence model
    Jia, Ye
    Weiss, Ron J.
    Biadsy, Fadi
    Macherey, Wolfgang
    Johnson, Melvin
    Chen, Zhifeng
    Wu, Yonghui
    INTERSPEECH 2019, 2019, : 1123 - 1127
  • [42] Intrusion Prediction With System-Call Sequence-to-Sequence Model
    Lv, Shaohua
    Wang, Jian
    Yang, Yinqi
    Liu, Jiqiang
    IEEE ACCESS, 2018, 6 : 71413 - 71421
  • [43] UnitNet: A Sequence-to-Sequence Acoustic Model for Concatenative Speech Synthesis
    Zhou, Xiao
    Ling, Zhen-Hua
    Dai, Li-Rong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2643 - 2655
  • [44] Decision Analysis of Sequence Multi-Stage IT Project under Uncertainty
    Zhang, Jian
    Jia, Suling
    Guo, Yanqin
    INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT, PROCEEDINGS, 2008, : 419 - 424
  • [45] A Sequence-to-Sequence Model for Online Signal Detection and Format Recognition
    Cheng, Le
    Zhu, Hongna
    Hu, Zhengliang
    Luo, Bin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 994 - 998
  • [46] Mandarin Prosody Boundary Prediction based on Sequence-to-sequence Model
    Yan, Yajing
    Jiang, Jiaolong
    Yang, Hongwu
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1013 - 1017
  • [47] Towards Sequence-to-Sequence Neural Model for Croatian Abstractive Summarization
    Davidovic, Vlatka
    Ipsic, Sanda Martincic
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, CECIIS, 2023, : 309 - 315
  • [48] Graph augmented sequence-to-sequence model for neural question generation
    Hui Ma
    Jian Wang
    Hongfei Lin
    Bo Xu
    Applied Intelligence, 2023, 53 : 14628 - 14644
  • [49] A Clustering based Adaptive Sequence-to-Sequence Model for Dialogue Systems
    Ren, Da
    Cai, Yi
    Chan, Wai Hong
    Li, Zongxi
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 775 - 781
  • [50] CORRECTION OF AUTOMATIC SPEECH RECOGNITION WITH TRANSFORMER SEQUENCE-TO-SEQUENCE MODEL
    Hrinchuk, Oleksii
    Popova, Mariya
    Ginsburg, Boris
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7074 - 7078