Detecting multi-stage attacks using sequence-to-sequence model

被引:13
|
作者
Zhou, Peng [1 ]
Zhou, Gongyan [1 ]
Wu, Dakui [1 ]
Fei, Minrui [1 ]
机构
[1] Shanghai Univ, Shanghai Key Lab Power Stn Automat Technol, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Multi-stage attack; Intrusion detection; Sequence-to-sequence model; Encoder-decoder architecture; Long-short term memory (LSTM) network;
D O I
10.1016/j.cose.2021.102203
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-stage attack is a kind of sophisticated intrusion strategy that has been widely used for penetrating the well protected network infrastructures. To detect such attacks, state-of-theart research advocates the use of hidden markov model (HMM). However, despite the HMM can model the relationships and dependencies among different alerts and stages for detection, they cannot handle well the stage dependencies buried in a longer sequence of alerts. In this paper, we tackle the challenge of the stages' long-term dependency and propose a new detection solution using a sequence-to-sequence (seq2seq) model. The basic idea is to encode a sequence of alerts (i.e., detector's observation) into a latent feature vector using a long-short term memory (LSTM) network and then decode this vector to a sequence of predicted attacking stages with another LSTM. By the encoder-decoder collaboration, we can decouple the local constraint between the observed alerts and the potential attacking stages, and thus able to take the full knowledge of all the alerts for the detection of stages in a sequence basis. By the LSTM, we can learn to "forget" irrelevant alerts and thereby have more opportunities to "remember" the long-term dependency between different stages for our sequence detection. To evaluate our model's effectiveness, we have conducted extensive experiments using four public datasets, all of which include simulated or re-constructed samples of real-world multi-stage attacks in controlled testbeds. Our results have successfully confirmed the better detection performance of our model compared with the previous HMM solutions. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [11] A Sequence-to-Sequence Model for Semantic Role Labeling
    Daza, Angel
    Frank, Anette
    REPRESENTATION LEARNING FOR NLP, 2018, : 207 - 216
  • [12] Document Ranking with a Pretrained Sequence-to-Sequence Model
    Nogueira, Rodrigo
    Jiang, Zhiying
    Pradeep, Ronak
    Lin, Jimmy
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 708 - 718
  • [13] Question Generation Using Sequence-to-Sequence Model with Semantic Role Labels
    Naeiji, Alireza
    An, Aijun
    Davoudi, Heidar
    Delpisheh, Marjan
    Alzghool, Muath
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2830 - 2842
  • [14] Architectures for Detecting Interleaved Multi-Stage Network Attacks Using Hidden Markov Models
    Shawly, Tawfeeq
    Elghariani, Ali
    Kobes, Jason
    Ghafoor, Arif
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2316 - 2330
  • [15] Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models
    Zech, John
    Forde, Jessica
    Titano, Joseph J.
    Kaji, Deepak
    Costa, Anthony
    Oermann, Eric Karl
    ANNALS OF TRANSLATIONAL MEDICINE, 2019, 7 (11)
  • [16] MULTI-SCALE ALIGNMENT AND CONTEXTUAL HISTORY FOR ATTENTION MECHANISM IN SEQUENCE-TO-SEQUENCE MODEL
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 648 - 655
  • [17] AN ANALYSIS OF INCORPORATING AN EXTERNAL LANGUAGE MODEL INTO A SEQUENCE-TO-SEQUENCE MODEL
    Kannan, Anjuli
    Wu, Yonghui
    Nguyen, Patrick
    Sainath, Tara N.
    Chen, Zhifeng
    Prabhavalkar, Rohit
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5824 - 5828
  • [18] A sequence-to-sequence model for joint bridge response forecasting
    Bahrami, Omid
    Wang, Wentao
    Hou, Rui
    Lynch, Jerome P.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 203
  • [19] Estimating Power Consumption of Air-conditioners Using a Sequence-to-sequence Model
    Hwang, Inhwan
    Cho, Hyeonje
    Ji, Yunhu
    Kim, Huijung
    2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2019, : 295 - 300
  • [20] A Sequence-to-Sequence Pronunciation Model for Bangla Speech Synthesis
    Ahmad, Arif
    Hussain, Mohammed Raihan
    Selim, Mohammad Reza
    Iqbal, Muhammed Zafar
    Rahman, Mohammad Shahidur
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,