An Improved Latent Dirichlet Allocation Method for Service Topic Detection

被引:0
|
作者
Guo Lantian [1 ]
Li Zhe [1 ]
Yang Tao [1 ,2 ]
Zhang Huixiang [1 ]
Mu Dejun [1 ]
Li Yang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
关键词
Word Embedding; LDA Model; Service Topic; Perplexity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Service topic detection is one of the most important techniques in service information extraction, clustering and recommendation. Comparing with short text corpus in social network, service description corpus possesses higher dimensionality and more diversity. It is difficult to detect topics from a large number of service descriptions. To address these challenges, we proposed a new LDA (Latent Dirichlet Allocation) model based topic detection method, referred to as CV- LDA (Context sensitive word Vector based LDA). It utilizes a word embedding based method that generate context sensitive vector to cluster the words for decreasing dimensionality. Through topic perplexity analysis in the real- world dataset, it is obvious that topics detected by our method has a lower perplexity, comparing with word frequency weighing based vectors.
引用
收藏
页码:7045 / 7049
页数:5
相关论文
共 50 条
  • [21] Road Traffic Topic Modeling on Twitter using Latent Dirichlet Allocation
    Hidayatullah, Ahmad Fathan
    Ma'arif, Muhammad Rifqi
    [J]. 2017 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET), 2017, : 47 - 52
  • [22] Topic Modeling of Online Accommodation Reviews via Latent Dirichlet Allocation
    Sutherland, Ian
    Sim, Youngseok
    Lee, Seul Ki
    Byun, Jaemun
    Kiatkawsin, Kiattipoom
    [J]. SUSTAINABILITY, 2020, 12 (05) : 1 - 15
  • [23] iLDA: An interactive latent Dirichlet allocation model to improve topic quality
    Liu, Yezheng
    Du, Fei
    Sun, Jianshan
    Jiang, Yuanchun
    [J]. JOURNAL OF INFORMATION SCIENCE, 2020, 46 (01) : 23 - 40
  • [24] ldagibbs: A command for topic modeling in Stata using latent Dirichlet allocation
    Schwarz, Carlo
    [J]. STATA JOURNAL, 2018, 18 (01): : 101 - 117
  • [25] A FRAMEWORK OF URDU TOPIC MODELING USING LATENT DIRICHLET ALLOCATION (LDA)
    Shakeel, Khadija
    Tahir, Ghulam Rasool
    Tehseen, Irsha
    Ali, Mubashir
    [J]. 2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 117 - 123
  • [26] Latent Dirichlet allocation (LDA) for topic modeling of the CFPB consumer complaints
    Bastani, Kaveh
    Namavari, Hamed
    Shaffer, Jeffrey
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 127 : 256 - 271
  • [27] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [28] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Hamed Jelodar
    Yongli Wang
    Chi Yuan
    Xia Feng
    Xiahui Jiang
    Yanchao Li
    Liang Zhao
    [J]. Multimedia Tools and Applications, 2019, 78 : 15169 - 15211
  • [29] A Latent Dirichlet Allocation method for Selectional Preferences
    Ritter, Alan
    Mausam
    Etzioni, Oren
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 424 - 434
  • [30] Technological topic analysis of standard-essential patents based on the improved Latent Dirichlet Allocation (LDA) model
    Tian, Chen
    Zhang, Junyan
    Liu, Dayong
    Wang, Qing
    Lin, Shen
    [J]. TECHNOLOGY ANALYSIS & STRATEGIC MANAGEMENT, 2024, 36 (09) : 2084 - 2099