A Tag-based Improved LDA and Web Page Clustering Analysis

被引:2
|
作者
Chen, Fang [1 ]
Zhou, Yanhui [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
关键词
Web tags; LDA model; Text clustering;
D O I
10.4028/www.scientific.net/AMM.667.277
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
With the rapid development of Internet, tag technology has been widely used in various sites. The brief text labels of network resources are greatly convenient for people to access the massive data. Social tags allows the user to use any word ----to tag network objects, and to share these tags, because of its simple and flexible operation, and it has become one of the popular applications. However, there exists some problems like noise of tags, lack of using criteria, and sparse distribution etc. Especially sparsity of tags seriously limits its application in the semantic analysis of web pages. This paper, by exploiting the user-related tag expansion method to overcome this problem, at the same time by using the topic model----LDA to model the web tags, mine its potential topic from the large-scale web page, and obtain the topic distribution of the text to the text clustering analysis. The experimental results show that, compared with the traditional clustering algorithm, the method of based LDA clustering on the analysis of the web tags have a larger increase.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 50 条
  • [1] Improved Search in Tag-Based Systems
    Awawdeh, Ruba
    Anderson, Terry
    [J]. 2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 288 - 293
  • [2] Spectral Clustering of Web Services by Fusing Document-based and Tag-based Topics Similarity
    Deng, Liping
    Zheng, Wen
    [J]. 2020 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2020), 2020, : 107 - 112
  • [3] Tag-based Web Photo Retrieval Improved by Batch Mode Re-Tagging
    Chen, Lin
    Xu, Dong
    Tsang, Ivor W.
    Luo, Jiebo
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3440 - 3446
  • [4] Web Service Clustering Approach Based on Network and Fused Document-Based and Tag-Based Topics Similarity
    Ping, Deng Li
    Bing, Guo
    Wen, Zheng
    [J]. INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2021, 18 (03) : 63 - 81
  • [5] Synthesis of Collective Tag-Based Opinions in the Social Web
    Cena, Federica
    Likavec, Silvia
    Lombardi, Ilaria
    Picardi, Claudia
    [J]. AI(STAR)IA 2011: ARTIFICIAL INTELLIGENCE AROUND MAN AND BEYOND, 2011, 6934 : 286 - 298
  • [6] Tag-based Analysis at the BESIII Experiment
    Deng, Z. Y.
    Zou, J. H.
    Sun, S. S.
    Liu, B. J.
    Wang, L.
    Shi, J. Y.
    Xiong, X. A.
    Zhang, S. F.
    [J]. 19TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2020, 1525
  • [7] Detection of cloaked web spam by using tag-based methods
    Lin, Jun-Lin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7493 - 7499
  • [8] A Web Page Clustering Method Based on Formal Concept Analysis
    Zhang, Zuping
    Zhao, Jing
    Yan, Xiping
    [J]. INFORMATION, 2018, 9 (09)
  • [9] A feature reduction technique for improved web page clustering
    Mohamed, Ehab Abdel-Hamid
    El-Beltagy, Samhaa R.
    El-Gamal, Salwa
    [J]. 2006 INNOVATIONS IN INFORMATION TECHNOLOGY, 2006, : 280 - +
  • [10] Analysis of Web page image tag distribution characteristics
    Ajiferuke, I
    Wolfram, D
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (04) : 987 - 1002