Malicious Domain Names Detection Algorithm Based on Lexical Analysis and Feature Quantification

被引:0
|
作者
Zhao, Hong [1 ]
Chang, Zhaobin [1 ]
Wang, Weijie [1 ]
Zeng, Xiangyan [2 ]
机构
[1] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou 730050, Gansu, Peoples R China
[2] Ft Valley State Univ, Dept Math & Comp Sci, Ft Valley, GA 31030 USA
来源
IEEE ACCESS | 2019年 / 7卷
基金
美国国家科学基金会;
关键词
Malicious domain names; N-gram; domain name substring; edit distance; reputation value; DGA-BASED BOTNET; DNS; INTERNET; ALEXA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malicious domain names usually refer to a series of illegal activities, posing threats to people's privacy and property. Therefore, the problem of detecting malicious domain names has aroused widespread concerns. In this study, a malicious domain names detection algorithm based on lexical analysis and feature quantification is proposed. To achieve efficient and accurate detection, the method includes two phases. The first phase checks an observed domain name against a blacklist of known malicious uniform resource locator (URLs). The observed domain name is classified as being definitely malicious or potentially malicious based on its edit distances to the domain names on the blacklist. The second phase further evaluates a potential malicious domain name by its reputation value that represents its lexical feature and is calculated based on an N-gram model. The top 100,000 normal domain names in Alexa are used to obtain a whitelist substring set using the N-gram method in which each domain name excluding the top-level domain is segmented into substrings with the length of 3, 4, 5, 6 and 7. The weighted values of the substrings are calculated according to their occurrence counts in the whitelist substring set. A potential malicious domain name is segmented by the N-gram method and its reputation value is calculated based on the weighted values of its substrings. Finally, the potential malicious domain name is determined to be malicious or normal based on its reputation value. The effectiveness of the proposed detection method has been demonstrated by experiments on public available data.
引用
收藏
页码:128990 / 128999
页数:10
相关论文
共 50 条
  • [1] Malicious Domain Names Detection Algorithm Based on N-Gram
    Zhao, Hong
    Chang, Zhaobin
    Bao, Guangbin
    Zeng, Xiangyan
    [J]. JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2019, 2019
  • [2] Detection of malicious and abusive domain names
    Kidmose, Egon
    Lansing, Erwin
    Brandbyge, Soren
    Pedersen, Jens Myrup
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, : 49 - 56
  • [3] A review on lexical based malicious domain name detection methods
    Hamroun, Cherifa
    Amamou, Ahmed
    Haddadou, Kamel
    Haroun, Hayat
    Pujolle, Guy
    [J]. ANNALS OF TELECOMMUNICATIONS, 2024, 79 (7-8) : 457 - 473
  • [4] Improved Detection of Malicious Domain Names Using Gradient Boosted Machines and Feature Engineering
    Alhogail, Areej
    Al-Turaiki, Isra
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2022, 51 (02): : 313 - 331
  • [5] A Framework for Malicious Domain Names Detection Using Feature Selection and Majority Voting Approach
    Patil, Dharmaraj R.
    [J]. Informatica (Slovenia), 2024, 48 (03): : 419 - 438
  • [6] Detection of malicious domain names based on an improved hidden Markov model
    Tang H.
    Dong C.
    [J]. International Journal of Wireless and Mobile Computing, 2019, 16 (01): : 58 - 65
  • [7] Frequency Domain Feature Based Robust Malicious Traffic Detection
    Fu, Chuanpu
    Li, Qi
    Shen, Meng
    Xu, Ke
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (01) : 452 - 467
  • [8] Analysis of the Similarities in Malicious DNS Domain Names
    Lasota, Krzysztof
    Kozakiewicz, Adam
    [J]. SECURE AND TRUST COMPUTING, DATA MANAGEMENT, AND APPLICATIONS, 2011, 187 : 1 - 6
  • [9] Cardinality Analysis to Classify Malicious Domain Names
    Yoshida, Kenichi
    Fujiwara, Kazunori
    Sato, Akira
    Sannomiya, Shuji
    [J]. 2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 826 - 832
  • [10] Detecting Malicious Domain Names with Abnormal WHOIS Records Using Feature-Based Rules
    Cheng, Yanan
    Chai, Tingting
    Zhang, Zhaoxin
    Lu, Keyu
    Du, Yuejin
    [J]. COMPUTER JOURNAL, 2022, 65 (09): : 2262 - 2275