Modeling DNS Activities Based on Probabilistic Latent Semantic Analysis

被引:0
|
作者
Yuchi, Xuebiao [1 ]
Lee, Xiaodong [1 ]
Jin, Jian [1 ]
Yan, Baoping [1 ]
机构
[1] Chinese Acad Sci, China Internet Network Informat Ctr, Comp Network Informat Ctr, Beijing 100190, Peoples R China
关键词
Domain Name System; Probabilistic latent semantic analysis; Co-occurrence;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional Web usage mining techniques aim at discovering usage patterns from Web data at the page level, while little work is engaged in at some upper level. In this paper, we propose a novel approach to the characterization of Internet users' preference and interests at the domain name level. By summarizing Internet user's domain name access behaviors as the co-occurrences of users and targeting domain names, an aspect model is introduced to classify users and domain names into various groups according to their co-occurrences. Meanwhile, each group is characterized by extracting the property of characteristic users and domain names. Experimental results on real-world data sets show that our approach is effective in which some meaningful groups are identified. Thus, our approach could be used for detecting unusual behaviors on the Internet at the domain name level, which can alleviate the work of searching the joint space of users and domain names.
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [1] Probabilistic latent semantic analysis
    Hofmann, T
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 289 - 296
  • [2] COMPARISON OF LATENT SEMANTIC ANALYSIS AND PROBABILISTIC LATENT SEMANTIC ANALYSIS FOR DOCUMENTS CLUSTERING
    Kuta, Marcin
    Kitowski, Jacek
    [J]. COMPUTING AND INFORMATICS, 2014, 33 (03) : 652 - 666
  • [3] Latent semantic indexing: A probabilistic analysis
    Papadimitriou, CH
    Raghavan, P
    Tamaki, H
    Vempala, S
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2000, 61 (02) : 217 - 235
  • [4] Dynamic Threshold Model Based Probabilistic Latent Semantic Analysis
    Wang, Yiming
    Ye, Yangdong
    Zhu, Zhenfeng
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 424 - 429
  • [5] A web recommendation technique based on probabilistic latent semantic analysis
    Xu, GD
    Zhang, YC
    Zhou, XF
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 15 - 28
  • [6] Collaborative recommendation algorithm based on probabilistic matrix factorization in probabilistic latent semantic analysis
    Huang, Li
    Tan, Wenan
    Sun, Yong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (07) : 8711 - 8722
  • [7] Collaborative recommendation algorithm based on probabilistic matrix factorization in probabilistic latent semantic analysis
    Li Huang
    Wenan Tan
    Yong Sun
    [J]. Multimedia Tools and Applications, 2019, 78 : 8711 - 8722
  • [8] The Hierarchical Clustering Analysis of Hyperspectral Image Based on Probabilistic Latent Semantic Analysis
    Yi Wen-bin
    Shen Li
    Qi Yin-feng
    Tang Hong
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2011, 31 (09) : 2471 - 2475
  • [9] Unsupervised learning by probabilistic latent semantic analysis
    Hofmann, T
    [J]. MACHINE LEARNING, 2001, 42 (1-2) : 177 - 196
  • [10] Unsupervised Learning by Probabilistic Latent Semantic Analysis
    Thomas Hofmann
    [J]. Machine Learning, 2001, 42 : 177 - 196