Context-Aware Latent Dirichlet Allocation for Topic Segmentation

被引:6
|
作者
Li, Wenbo [1 ,2 ]
Matsukawa, Tetsu [1 ,2 ]
Saigo, Hiroto [1 ,2 ]
Suzuki, Einoshin [1 ,2 ,3 ]
机构
[1] Kyushu Univ, Grad Sch, Fukuoka, Japan
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka, Japan
[3] Kyushu Univ, Grad Sch Syst Life Sci, Fukuoka, Japan
基金
日本学术振兴会;
关键词
DISCOVERY; MODEL;
D O I
10.1007/978-3-030-47426-3_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new generative model for topic segmentation based on Latent Dirichlet Allocation. The task is to divide a document into a sequence of topically coherent segments, while preserving long topic change-points (coherency) and keeping short topic segments from getting merged (saliency). Most of the existing models either fuse topic segments by keywords or focus on modeling word co-occurrence patterns without merging. They can hardly achieve both coherency and saliency since many words have high uncertainties in topic assignments due to their polysemous nature. To solve this problem, we introduce topic-specific co-occurrence of word pairs within contexts in modeling, to generate more coherent segments and alleviate the influence of irrelevant words on topic assignment. We also design an optimization algorithm to eliminate redundant items in the generated topic segments. Experimental results show that our proposal produces significant improvements in both topic coherence and topic segmentation.
引用
收藏
页码:475 / 486
页数:12
相关论文
共 50 条
  • [41] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [42] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Hamed Jelodar
    Yongli Wang
    Chi Yuan
    Xia Feng
    Xiahui Jiang
    Yanchao Li
    Liang Zhao
    Multimedia Tools and Applications, 2019, 78 : 15169 - 15211
  • [43] A Probabilistic Topic Approach for Context-Aware Visual Attention Modeling
    Fernandez-Torres, Miguel-Angel
    Gonzalez-Diaz, Ivan
    Diaz-de-Maria, Fernando
    2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,
  • [44] Sparsely labeled coral images segmentation with Latent Dirichlet Allocation
    Yu, Xi
    Bing, Ouyang
    Principe, Jose C.
    Farrington, Stephanie
    Reed, John
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [45] Partial Membership Latent Dirichlet Allocation for Soft Image Segmentation
    Chen, Chao
    Zare, Alina
    Trinh, Huy N.
    Omotara, Gbenga O.
    Cobb, James Tory
    Lagaunne, Timotius A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5590 - 5602
  • [46] Context-Aware Topic Modeling for Content Tracking in Social Media
    Zhang, Jinjing
    Wang, Jing
    Li, Li
    WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 650 - 658
  • [47] Context-Aware Task Allocation for Distributed Agile Team
    Lin, Jun
    2013 28TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2013, : 758 - 761
  • [48] Context-aware resource allocation for cellular wireless networks
    Magnus Proebster
    Matthias Kaschub
    Thomas Werthmann
    Stefan Valentin
    EURASIP Journal on Wireless Communications and Networking, 2012
  • [49] Context-Aware Task Allocation for Quick Collaborative Responses
    Sun Yuqing
    Farwick, Matthias
    Hung, Patrick C. K.
    Chiu, Dickson K. W.
    Ji Guangjun
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (03): : 395 - 402
  • [50] Context-Aware Energy Saving with Proactive Power Allocation
    Hu, Yuzhou
    Han, Shengqian
    Yang, Chenyang
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 53 - 57