A Machine Learning Framework for Domain Generation Algorithm-Based Malware Detection

被引:41
|
作者
Li, Yi [1 ]
Xiong, Kaiqi [1 ]
Chin, Tommy [2 ]
Hu, Chengbin [1 ]
机构
[1] Univ S Florida, Florida Ctr Cybersecur, Intelligent Comp Networking & Secur Lab, Tampa, FL 33620 USA
[2] Rochester Inst Technol, Dept Comp Secur, Rochester, NY 14623 USA
基金
美国国家科学基金会;
关键词
Malware; domain generation algorithm; machine learning; security; networking; BIG DATA;
D O I
10.1109/ACCESS.2019.2891588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attackers usually use a command and control (C2) server to manipulate the communication. In order to perform an attack, threat actors often employ a domain generation algorithm (DGA), which can allow malware to communicate with C2 by generating a variety of network locations. Traditional malware control methods, such as blacklisting, are insufficient to handle DGA threats. In this paper, we propose a machine learning framework for identifying and detecting DGA domains to alleviate the threat. We collect real-time threat data from the real-life traffic over a one-year period. We also propose a deep learning model to classify a large number of DGA domains. The proposed machine learning framework consists of a two-level model and a prediction model. In the two-level model, we first classify the DGA domains apart from normal domains and then use the clustering method to identify the algorithms that generate those DGA domains. In the prediction model, a time-series model is constructed to predict incoming domain features based on the hidden Markov model (HMM). Furthermore, we build a deep neural network (DNN) model to enhance the proposed machine learning framework by handling the huge dataset we gradually collected. Our extensive experimental results demonstrate the accuracy of the proposed framework and the DNN model. To be precise, we achieve an accuracy of 95.89% for the classification in the framework and 97.79% in the DNN model, 92.45% for the second-level clustering, and 95.21% for the HMM prediction in the framework.
引用
收藏
页码:32765 / 32782
页数:18
相关论文
共 50 条
  • [21] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    MATHEMATICS, 2021, 9 (21)
  • [22] Automated machine learning for deep learning based malware detection
    Brown, Austin
    Gupta, Maanak
    Abdelsalam, Mahmoud
    COMPUTERS & SECURITY, 2024, 137
  • [23] Advanced Machine Learning Based Malware Detection Systems
    Kim, Song-Kyoo
    Feng, Xiaomei
    Al Hamadi, Hussam
    Damiani, Ernesto
    Yeun, Chan Yeob
    Nandyala, Sivaprasad
    IEEE ACCESS, 2024, 12 : 115296 - 115305
  • [24] Machine Learning Based Improved Malware Detection Schemes
    Priyadarshan, Pradosh
    Sarangi, Prateek
    Ratht, Adyasha
    Rath, Adyasha
    Panda, Ganapati
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 925 - 931
  • [25] An Android Malware Detection System Based on Machine Learning
    Wen, Long
    Yu, Haiyang
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [26] A Survey of Malware Detection Techniques based on Machine Learning
    El Merabet, Hoda
    Hajraoui, Abderrahmane
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 366 - 373
  • [27] On the Robustness of Machine Learning Based Malware Detection Algorithms
    Hu, Weiwei
    Tan, Ying
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1435 - 1441
  • [28] AN ASSOCIATIVE ARCHITECTURE FOR GENETIC ALGORITHM-BASED MACHINE LEARNING
    TWARDOWSKI, K
    COMPUTER, 1994, 27 (11) : 27 - 38
  • [29] MLDroid—framework for Android malware detection using machine learning techniques
    Arvind Mahindru
    A. L. Sangal
    Neural Computing and Applications, 2021, 33 : 5183 - 5240
  • [30] A Graph-Based Feature Generation Approach in Android Malware Detection with Machine Learning Techniques
    Liu, Xiaojian
    Lei, Qian
    Liu, Kehong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)