Symmetrization and overfitting in probabilistic latent semantic analysis

被引:1
|
作者
Leksin V.A. [1 ]
机构
[1] Moscow Institute of Physics and Technology, Dolgoprudnyi, Moscow oblast 141700
基金
俄罗斯基础研究基金会;
关键词
Collaborative filtering; Customer environment analysis; Latent profiles; Overfitting; Probabilistic latent semantic analysis; Symmetric models;
D O I
10.1134/S1054661809040014
中图分类号
学科分类号
摘要
An algorithm is proposed for revealing latent user's interests from the observable protocol of users behavior, e.g., site visits. The algorithm combines the ideas of customer environment analysis and probabilistic latent semantic analysis. A quality criterion based on the classification of preliminarily labeled sites is introduced to optimize the algorithm parameters and compare algorithms. The experiments show that the quality has an optimum by the essential parameters of the algorithm, however the attempt of too precise optimization can lead to overfitting. © 2009 Pleiades Publishing, Ltd.
引用
收藏
页码:565 / 574
页数:9
相关论文
共 50 条
  • [21] Incremental Probabilistic Latent Semantic Analysis for Automatic Question Recommendation
    Wu, Hu
    Wang, Yongji
    Cheng, Xiang
    RECSYS'08: PROCEEDINGS OF THE 2008 ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2008, : 99 - 106
  • [22] Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and Insights
    Figuera, Pau
    Bringas, Pablo Garcia
    TECHNOLOGIES, 2024, 12 (01)
  • [23] Using probabilistic latent semantic analysis for personalized web search
    Lin, CX
    Xue, GR
    Zeng, HJ
    Yu, Y
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 707 - 717
  • [24] Learning Similarity with Probabilistic Latent Semantic Analysis for Image Retrieval
    Li, Xiong
    Lv, Qi
    Huang, Wenting
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (04): : 1424 - 1440
  • [25] Online belief propagation algorithm for probabilistic latent semantic analysis
    Ye, Yun
    Gong, Shengrong
    Liu, Chunping
    Zeng, Jia
    Jia, Ning
    Zhang, Yi
    FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (04) : 526 - 535
  • [26] Using Probabilistic latent semantic analysis for web page grouping
    Xu, GD
    Zhang, YC
    Zhou, XF
    15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, Proceedings, 2005, : 29 - 36
  • [27] Online belief propagation algorithm for probabilistic latent semantic analysis
    Yun Ye
    Shengrong Gong
    Chunping Liu
    Jia Zeng
    Ning Jia
    Yi Zhang
    Frontiers of Computer Science, 2013, 7 : 526 - 535
  • [28] Modeling DNS Activities Based on Probabilistic Latent Semantic Analysis
    Yuchi, Xuebiao
    Lee, Xiaodong
    Jin, Jian
    Yan, Baoping
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 290 - 301
  • [29] RPLSA: A novel updating scheme for Probabilistic Latent Semantic Analysis
    Bassiou, N.
    Kotropoulos, C.
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (04): : 741 - 760
  • [30] A Parallel Probabilistic Latent Semantic Analysis Method on MapReduce Platform
    Liang, Zhao
    Li, Wenye
    Li, Yuxi
    2013 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2013, : 1017 - 1022