Domain-Embeddings Based DGA Detection with Incremental Training Method

被引:0
|
作者
Fang, Xin [1 ]
Sun, Xiaoqing [1 ]
Yang, Jiahai [1 ]
Liu, Xinran [2 ]
机构
[1] Tsinghua Univ, Inst Network Sci & Cyberspace, Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[2] Natl Comp Network Emergency Response Tech Team Co, Beijing, Peoples R China
关键词
Domain-Embeddings; DGA Detection; Word2vec; Incremental Training;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
DGA-based botnet, which uses Domain Generation Algorithms (DGAs) to evade supervision, has become a part of the most destructive threats to network security. Over the past decades, a wealth of defense mechanisms focusing on domain features have emerged to address the problem. Nonetheless, DGA detection remains a daunting and challenging task due to the big data nature of Internet traffic and the potential fact that the linguistic features extracted only from the domain names are insufficient and the enemies could easily forge them to disturb detection. In this paper, we propose a novel DGA detection system which employs an incremental word-embeddings method to capture the interactions between end hosts and domains, characterize time-series patterns of DNS queries for each IP address and therefore explore temporal similarities between domains. We carefully modify the Word2Vec algorithm and leverage it to automatically learn dynamic and discriminative feature representations for over 1.9 million domains, and develop an simple classifier for distinguishing malicious domains from the benign. Given the ability to identify temporal patterns of domains and update models incrementally, the proposed scheme makes the progress towards adapting to the changing and evolving strategies of DGA domains. Our system is evaluated and compared with the state-of-art system FANCI and two deep-learning methods CNN and LSTM, with data from a large university's network named TUNET. The results suggest that our system outperforms the strong competitors by a large margin on multiple metrics and meanwhile achieves a remarkable speed-up on model updating.
引用
收藏
页码:185 / 190
页数:6
相关论文
共 50 条
  • [1] DeepD2V-Deep Learning and Domain Word Embeddings for DGA based Malware Detection
    Torrealba Aravena, Lucas
    Casas, Pedro
    Bustos-Jimenez, Javier
    Capdehourat, German
    Findrik, Mislav
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 164 - 170
  • [2] Helix: DGA Domain Embeddings for Tracking and Exploring Botnets
    Sidi, Lior
    Mirsky, Yisroel
    Nadler, Asaf
    Elovici, Yuval
    Shabtai, Asaf
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2741 - 2748
  • [3] Character Level based Detection of DGA Domain Names
    Yu, Bin
    Pan, Jie
    Hu, Jiaming
    Nascimento, Anderson
    De Cock, Martine
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [4] DOLPHIN: Phonics based Detection of DGA Domain Names
    Zhao, Dan
    Li, Hao
    Sun, Xiuwen
    Tang, Yazhe
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [5] A DGA Domain Name Detection Method Based on Two-Stage Feature Reinforcement
    Yang, Hongyu
    Zhang, Tao
    Hu, Ze
    Zhang, Liang
    Cheng, Xiang
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 652 - 659
  • [6] A DGA Domain Name Detection Method of Multilevel Feature Probability
    Yang, Hongyu
    Zhang, Tao
    Zhang, Liang
    Hu, Ze
    Xie, Lixia
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (05): : 86 - 91
  • [7] A DGA Domain Name Detection Method Based on Deep Learning Models with Mixed Word Embedding
    Du, Peng
    Ding, Shifei
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (02): : 433 - 446
  • [8] DGA domain name detection based on BiGRU-MCNN
    Chen, ChaoQuan
    Pan, LeiLei
    Xie, XiaoLan
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 316 - 320
  • [9] A Novel Detection Method for Word-Based DGA
    Yang, Luhui
    Liu, Guangjie
    Zhai, Jiangtao
    Dai, Yuewei
    Yan, Zhaozhi
    Zou, Yuguang
    Huang, Wenchao
    CLOUD COMPUTING AND SECURITY, PT II, 2018, 11064 : 472 - 483
  • [10] A DGA domain names detection modeling method based on integrating an attention mechanism and deep neural network
    Ren, Fangli
    Jiang, Zhengwei
    Wang, Xuren
    Liu, Jian
    CYBERSECURITY, 2020, 3 (01)