A Machine Learning Framework for Domain Generation Algorithm-Based Malware Detection

被引:41
|
作者
Li, Yi [1 ]
Xiong, Kaiqi [1 ]
Chin, Tommy [2 ]
Hu, Chengbin [1 ]
机构
[1] Univ S Florida, Florida Ctr Cybersecur, Intelligent Comp Networking & Secur Lab, Tampa, FL 33620 USA
[2] Rochester Inst Technol, Dept Comp Secur, Rochester, NY 14623 USA
基金
美国国家科学基金会;
关键词
Malware; domain generation algorithm; machine learning; security; networking; BIG DATA;
D O I
10.1109/ACCESS.2019.2891588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attackers usually use a command and control (C2) server to manipulate the communication. In order to perform an attack, threat actors often employ a domain generation algorithm (DGA), which can allow malware to communicate with C2 by generating a variety of network locations. Traditional malware control methods, such as blacklisting, are insufficient to handle DGA threats. In this paper, we propose a machine learning framework for identifying and detecting DGA domains to alleviate the threat. We collect real-time threat data from the real-life traffic over a one-year period. We also propose a deep learning model to classify a large number of DGA domains. The proposed machine learning framework consists of a two-level model and a prediction model. In the two-level model, we first classify the DGA domains apart from normal domains and then use the clustering method to identify the algorithms that generate those DGA domains. In the prediction model, a time-series model is constructed to predict incoming domain features based on the hidden Markov model (HMM). Furthermore, we build a deep neural network (DNN) model to enhance the proposed machine learning framework by handling the huge dataset we gradually collected. Our extensive experimental results demonstrate the accuracy of the proposed framework and the DNN model. To be precise, we achieve an accuracy of 95.89% for the classification in the framework and 97.79% in the DNN model, 92.45% for the second-level clustering, and 95.21% for the HMM prediction in the framework.
引用
收藏
页码:32765 / 32782
页数:18
相关论文
共 50 条
  • [1] A machine learning framework for domain generating algorithm based malware detection
    Akhila, G. P.
    Gayathri, R.
    Keerthana, S.
    Gladston, Angelin
    SECURITY AND PRIVACY, 2020, 3 (06):
  • [2] A Machine Learning Framework for Studying Domain Generation Algorithm (DGA)-Based Malware
    Chin, Tommy
    Xiong, Kaiqi
    Hu, Chengbin
    Li, Yi
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2018, PT I, 2018, 254 : 433 - 448
  • [3] AdamW plus : Machine Learning Framework to Detect Domain Generation Algorithms for Malware
    Javed, Awais
    Rashid, Imran
    Tahir, Shahzaib
    Saeed, Saqib
    Almuhaideb, Abdullah M.
    Alissa, Khalid
    IEEE ACCESS, 2024, 12 : 79138 - 79150
  • [4] Machine learning algorithm-based spam detection in social networks
    M. Sumathi
    S. P. Raja
    Social Network Analysis and Mining, 13
  • [5] Machine learning algorithm-based spam detection in social networks
    Sumathi, M.
    Raja, S. P.
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [6] A Machine-Learning-Based Framework for Supporting Malware Detection and Analysis
    Cuzzocrea, Alfredo
    Mercaldo, Francesco
    Martinelli, Fabio
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 353 - 365
  • [7] Adaptive Machine learning: A Framework for Active Malware Detection
    Aslam, Muhammad
    Ye, Dengpan
    Hanif, Muhammad
    Asad, Muhammad
    2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, : 57 - 64
  • [8] An Adversarial Reinforcement Learning Framework for Robust Machine Learning-based Malware Detection
    Ebrahimi, Mohammadreza
    Li, Weifeng
    Chai, Yidong
    Pacheco, Jason
    Chen, Hsinchun
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 567 - 576
  • [9] Static analysis framework for permission-based dataset generation and android malware detection using machine learning
    Pathak, Amarjyoti
    Kumar, Th. Shanta
    Barman, Utpal
    EURASIP JOURNAL ON INFORMATION SECURITY, 2024, 2024 (01):
  • [10] Leveraging Classification and Detection of Malware: A Robust Machine Learning-Based Framework
    Sethi, Lingaraj
    Patra, Prashanta Kumar
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 299 - 306