Topic Detection from Microblog Based on Text Clustering and Topic Model Analysis

被引:4
|
作者
Huang, Siqi [1 ]
Yang, Yitao [1 ]
Li, Huakang [1 ]
Sun, Guozi [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, 66 Xin Mofan Rd, Nanjing 210003, Jiangsu, Peoples R China
关键词
Microblog; topic detection; text clustering; LDA; LATENT SEMANTIC ANALYSIS;
D O I
10.1109/APSCC.2014.18
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper raises a Microblog topic detection method based on text clustering and topic model analysis. It solves the problem that the traditional topic detection method is mainly applicable for traditional media text, which is not very effective in handling sparse Microblog short texts. In consequence of the structural data of the Microblog, which exists rich inter-textual contextual information such as retweets, comments, user hashtag, embedded link URL, we first put forward a feature weight pre-processing method. We also use a clustering algorithm based on word vectors to enrich the feature information of the data. On this basis, we extend the conventional LDA (Latent Dirichlet allocation) topic model to extract the hot topics in the Microblog data. Compared with the traditional methods, the method raised in this paper is much more effective in the collected text corpus in Sina Microblog.
引用
收藏
页码:88 / 92
页数:5
相关论文
共 50 条
  • [1] SENTIMENT ANALYSIS OF MICROBLOG TEXT BASED ON JOINT SENTIMENT-TOPIC MODEL
    Zhang, Hui
    Liu, Yiqun
    Ma, Shaoping
    [J]. 2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 46 - 54
  • [2] MICROBLOG HOT TOPIC DETECTION BASED ON TOPIC MODEL USING TERM CORRELATION MATRIX
    Ma, Hui-Fang
    Sun, Yue-Xin
    Jia, Mei-Hui-Zi
    Zhang, Zhi-Chang
    [J]. PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 126 - 130
  • [3] A Novel Hybrid Clustering Algorithm for Microblog Topic Detection
    Geng, Xiao
    Zhang, Yanmei
    Jiao, Yuhang
    Mei, Yinan
    [J]. 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, RESOURCE AND ENVIRONMENTAL ENGINEERING (MSREE 2017), 2017, 1890
  • [4] Clustering Based Topic Events Detection on Text Stream
    Li, Chunshan
    Ye, Yunming
    Zhang, Xiaofeng
    Chu, Dianhui
    Deng, Shengchun
    Xu, Xiaofei
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT 1, 2014, 8397 : 42 - 52
  • [5] Microblog bursty topic detection method based on momentum model
    He, Min
    Du, Pan
    Zhang, Jin
    Liu, Yue
    Cheng, Xueqi
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (05): : 1022 - 1028
  • [6] A popular topic detection method based on microblog images and short text information
    Liu, Wenjun
    Wang, Hai
    Wang, Jieyang
    Guo, Huan
    Sun, Yuyan
    Hou, Mengshu
    Yu, Bao
    Wang, Hailan
    Peng, Qingcheng
    Zhang, Chao
    Liu, Cheng
    [J]. JOURNAL OF WEB SEMANTICS, 2024, 81
  • [7] A Topic Detection Method Based on Microblog Weight
    Guo, Kaijie
    Shi, Liang
    [J]. 2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 209 - 212
  • [8] A Topic-Rank Recommendation Model Based on Microblog Topic Relevance & User Preference Analysis
    Bao, Fuguan
    Xu, Wenqian
    Feng, Yao
    Xu, Chonghuan
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
  • [9] A novel text clustering model based on topic modelling and social network analysis
    Amiri, Babak
    Karimianghadim, Ramin
    [J]. CHAOS SOLITONS & FRACTALS, 2024, 181
  • [10] Microblog Sentiment Topic Model
    Ahuja, Aman
    Wei, Wei
    Carley, Kathleen M.
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 1031 - 1038