Text-based Malicious Domain Names Detection Based on Variational Autoencoder And Supervised Learning

被引:0
|
作者
Sun, Yuwei [1 ]
Chong, Ng S. T. [2 ]
Ochiai, Hideya [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] United Nations Univ, Campus Comp Ctr, Tokyo, Japan
关键词
malicious domain names detection; VAE; cybersecurity; machine learning;
D O I
10.1109/CISS48834.2020.1570601577
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of information technology, adaptation of an information system in industries and institutes has become more and more common. However, attacks like using zombie networks to access a host thus causing it to shut down are frequent in recent years. Domain names play a significant role in the connection with a server, considered as a key for detecting these attacks. In this paper, we propose a text-based method to convert domain names into numeric features, based on the term frequency and inverse document frequency (TF-IDF). Then we adopt the variational autoencoder (VAE) consisting of an encoder and a decoder, extracting hidden information from features. Moreover, through collapsing the Gaussian distribution of these features at the hidden layer to its mean, the distribution of domain names is visualized. After that, we adopt a supervised learning called Convolutional Neural Network (CNN) for the classification between the malicious and benign. We train the model using feature vectors from the VAE. At last, the scheme achieves a validation accuracy of 0.868 for the malicious domain names detection.
引用
收藏
页码:192 / 196
页数:5
相关论文
共 50 条
  • [41] Detection and Identification of Text-based Traffic Signs
    Chi, Xiuyuan
    Luo, Dean
    Liang, Qice
    Yang, Junxing
    Huang, He
    [J]. SENSORS AND MATERIALS, 2023, 35 (01) : 153 - 165
  • [42] Explaining poor performance of text-based machine learning models for vulnerability detection
    Napier, Kollin
    Bhowmik, Tanmay
    Chen, Zhiqian
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [43] A Method for Constructing Supervised Time Topic Model Based on Variational Autoencoder
    Gou, Zhinan
    Li, Yan
    Huo, Zheng
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [44] A self-supervised learning model based on variational autoencoder for limited-sample mammogram classification
    Karagoz, Meryem Altin
    Nalbantoglu, O. Ufuk
    [J]. APPLIED INTELLIGENCE, 2024, 54 (04) : 3448 - 3463
  • [45] A self-supervised learning model based on variational autoencoder for limited-sample mammogram classification
    Meryem Altin Karagoz
    O. Ufuk Nalbantoglu
    [J]. Applied Intelligence, 2024, 54 : 3448 - 3463
  • [46] A Semi-Supervised Learning Method for MiRNA-Disease Association Prediction Based on Variational Autoencoder
    Ji, Cunmei
    Wang, Yutian
    Gao, Zhen
    Li, Lei
    Ni, Jiancheng
    Zheng, Chunhou
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 2049 - 2059
  • [47] DISENTANGLED SPEECH REPRESENTATION LEARNING BASED ON FACTORIZED HIERARCHICAL VARIATIONAL AUTOENCODER WITH SELF-SUPERVISED OBJECTIVE
    Xie, Yuying
    Arildsen, Thomas
    Tan, Zheng-Hua
    [J]. 2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [48] INTERACTIVE EFFECTS OF TEXT-BASED AND TASK-BASED IMPORTANCE ON LEARNING FROM TEXT
    SCHRAW, G
    WADE, SE
    KARDASH, CAM
    [J]. JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1993, 85 (04) : 652 - 661
  • [49] An Open Relation Extraction Method for Domain Text Based on Hybrid Supervised Learning
    Wang, Xiaoxiong
    Hu, Jianpeng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [50] Semi-supervised visual anomaly detection based on convolutional autoencoder and transfer learning
    Saeedi, Jamal
    Giusti, Alessandro
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2023, 11