An Improved Latent Dirichlet Allocation Method for Service Topic Detection

被引:0
|
作者
Guo Lantian [1 ]
Li Zhe [1 ]
Yang Tao [1 ,2 ]
Zhang Huixiang [1 ]
Mu Dejun [1 ]
Li Yang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
关键词
Word Embedding; LDA Model; Service Topic; Perplexity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Service topic detection is one of the most important techniques in service information extraction, clustering and recommendation. Comparing with short text corpus in social network, service description corpus possesses higher dimensionality and more diversity. It is difficult to detect topics from a large number of service descriptions. To address these challenges, we proposed a new LDA (Latent Dirichlet Allocation) model based topic detection method, referred to as CV- LDA (Context sensitive word Vector based LDA). It utilizes a word embedding based method that generate context sensitive vector to cluster the words for decreasing dimensionality. Through topic perplexity analysis in the real- world dataset, it is obvious that topics detected by our method has a lower perplexity, comparing with word frequency weighing based vectors.
引用
收藏
页码:7045 / 7049
页数:5
相关论文
共 50 条
  • [1] An Improved Latent Dirichlet Allocation Model for Hot Topic Extraction
    Liu, Guolong
    Xu, Xiaofei
    Zhu, Ying
    Li, Li
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 470 - 476
  • [2] Topic Selection in Latent Dirichlet Allocation
    Wang, Biao
    Liu, Zelong
    Li, Maozhen
    Liu, Yang
    Qi, Man
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 756 - 760
  • [3] Topic Modelling Twitter Data with Latent Dirichlet Allocation Method
    Negara, Edi Surya
    Triadi, Dendi
    Andryani, Ria
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS 2019), 2019, : 386 - 390
  • [4] Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation
    Bolelli, Levent
    Ertekin, Seyda
    Giles, C. Lee
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 776 - +
  • [5] Using Latent Dirichlet Allocation to Incorporate Domain Knowledge For Topic Transition Detection
    Zhu, Xiaodan
    He, Xuming
    Munteanu, Cosmin
    Penn, Gerald
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2442 - 2445
  • [6] Topic Modeling Using Latent Dirichlet allocation: A Survey
    Chauhan, Uttam
    Shah, Apurva
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (07)
  • [7] A Hybrid Latent Dirichlet Allocation Approach for Topic Classification
    Hsu, Chi-I
    Chiu, Chaochang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 312 - 315
  • [8] Semantic latent dirichlet allocation for automatic topic extraction
    Bhutada, Sunil
    Balaram, V. V. S. S. S.
    Bulusu, Vishnu Vardhan
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2016, 37 (03): : 449 - 469
  • [9] Using Latent Dirichlet Allocation for Topic Modelling in Twitter
    Ostrowski, David Alfred
    [J]. 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, : 493 - 497
  • [10] Topic Model Allocation of Conversational Dialogue Records by Latent Dirichlet Allocation
    Yeh, Jui-Feng
    Lee, Chen-Hsien
    Tan, Yi-Shiuan
    Yu, Liang-Chih
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,