An Improved Latent Dirichlet Allocation Method for Service Topic Detection

被引:0
|
作者
Guo Lantian [1 ]
Li Zhe [1 ]
Yang Tao [1 ,2 ]
Zhang Huixiang [1 ]
Mu Dejun [1 ]
Li Yang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
关键词
Word Embedding; LDA Model; Service Topic; Perplexity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Service topic detection is one of the most important techniques in service information extraction, clustering and recommendation. Comparing with short text corpus in social network, service description corpus possesses higher dimensionality and more diversity. It is difficult to detect topics from a large number of service descriptions. To address these challenges, we proposed a new LDA (Latent Dirichlet Allocation) model based topic detection method, referred to as CV- LDA (Context sensitive word Vector based LDA). It utilizes a word embedding based method that generate context sensitive vector to cluster the words for decreasing dimensionality. Through topic perplexity analysis in the real- world dataset, it is obvious that topics detected by our method has a lower perplexity, comparing with word frequency weighing based vectors.
引用
收藏
页码:7045 / 7049
页数:5
相关论文
共 50 条
  • [31] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [32] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 601 - 608
  • [33] Employing Latent Dirichlet Allocation for fraud detection in telecommunications
    Xing, Dongshan
    Girolami, Mark
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1727 - 1734
  • [34] Research Topic Analysis in Engineering Management Using a Latent Dirichlet Allocation Model
    Kim, Jin Ho
    Chen, Weiru
    [J]. JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT-INNOVATION AND ENTREPRENEURSHIP, 2018, 3 (04):
  • [35] Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
    Putthividhya, Duangmanee
    Attias, Hagai T.
    Nagarajan, Srikantan S.
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3408 - 3415
  • [36] Useful ToPIC: Self-tuning strategies to enhance Latent Dirichlet Allocation
    Proto, Stefano
    Di Corso, Evelina
    Ventura, Francesco
    Cerquitelli, Tania
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 33 - 40
  • [37] AUGMENTED LATENT DIRICHLET ALLOCATION (LDA) TOPIC MODEL WITH GAUSSIAN MIXTURE TOPICS
    Prabhudesai, Kedar S.
    Mainsah, Boyla O.
    Collins, Leslie M.
    Throckmorton, Chandra S.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2451 - 2455
  • [38] Local–class–shared–topic latent Dirichlet allocation based scene classification
    Chao Huang
    Wang Luo
    Yurui Xie
    [J]. Multimedia Tools and Applications, 2017, 76 : 15661 - 15679
  • [39] Topic Analysis of the Research Domain in Knowledge Organization: A Latent Dirichlet Allocation Approach
    Joo, Soohyung
    Choi, Inkyung
    Choi, Namjoo
    [J]. KNOWLEDGE ORGANIZATION, 2018, 45 (02): : 170 - 183
  • [40] Topic Extraction and Sentiment Classification by using Latent Dirichlet Markov Allocation and SentiWordNet
    Kaur, Preet Chandan
    Ghorpade, Tushar
    Mane, Vanita
    [J]. INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY & COMPUTING, 2016, 2016,