Labeled Phrase Latent Dirichlet Allocation

被引:6
|
作者
Tang, Yi-Kun [1 ]
Mao, Xian-Ling [1 ]
Huang, Heyan [1 ]
机构
[1] Beijing Inst Technol, Dept Comp Sci & Technol, Beijing 100081, Peoples R China
关键词
Labeled Phrase LDA; Topic model; Multi-labeled corpus;
D O I
10.1007/978-3-319-48740-3_39
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, topic modeling, such as Latent Dirichlet Allocation (LDA) and its variations, has been widely used to discover the abstract topics in text corpora. There are two state-of-the-art topic models: Labeled LDA (LLDA) and PhraseLDA. LLDA is a supervised generative model which considers the label information, but it does not take into consideration word order under the bag-of-words assumption. On the contrary, PhraseLDA regards each document as a mixture of phrases, which partly considers the word order. However, PhraseLDA cannot model the supervised label information. In this paper, in order to overcome the defects of two models above while combining their merits, we propose a novel topic model, called Labeled Phrase LDA, which synchronously considers the supervised information and word order. Lots of experiments were conducted among the proposed model and two state-of-the-art models, which show the proposed model significantly outperforms baselines in terms of case study, perplexity and scalability.
引用
收藏
页码:525 / 536
页数:12
相关论文
共 50 条
  • [1] Labeled Phrase Latent Dirichlet Allocation and its online learning algorithm
    Tang, Yi-Kun
    Mao, Xian-Ling
    Huang, Heyan
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 885 - 912
  • [2] Labeled Phrase Latent Dirichlet Allocation and its online learning algorithm
    Yi-Kun Tang
    Xian-Ling Mao
    Heyan Huang
    [J]. Data Mining and Knowledge Discovery, 2018, 32 : 885 - 912
  • [3] An Online Inference Algorithm for Labeled Latent Dirichlet Allocation
    Zhou, Qiang
    Huang, Heyan
    Mao, Xian-Ling
    [J]. WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 17 - 28
  • [4] Supervised labeled latent Dirichlet allocation for document categorization
    Li, Ximing
    Ouyang, Jihong
    Zhou, Xiaotang
    Lu, You
    Liu, Yanhui
    [J]. APPLIED INTELLIGENCE, 2015, 42 (03) : 581 - 593
  • [5] Supervised labeled latent Dirichlet allocation for document categorization
    Ximing Li
    Jihong Ouyang
    Xiaotang Zhou
    You Lu
    Yanhui Liu
    [J]. Applied Intelligence, 2015, 42 : 581 - 593
  • [6] Sparsely labeled coral images segmentation with Latent Dirichlet Allocation
    Yu, Xi
    Bing, Ouyang
    Principe, Jose C.
    Farrington, Stephanie
    Reed, John
    [J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [7] Human Action Recognition Using Labeled Latent Dirichlet Allocation Model
    Yang, Jiahui
    Chen, Changhong
    Gan, Zongliang
    Zhu, Xiuchang
    [J]. 2013 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2013), 2013,
  • [8] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 601 - 608
  • [10] On the Effectiveness of Labeled Latent Dirichlet Allocation in Automatic Bug-Report Categorization
    Zibran, Minhaz F.
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C), 2016, : 713 - 715