Context-Aware Latent Dirichlet Allocation for Topic Segmentation

被引:6
|
作者
Li, Wenbo [1 ,2 ]
Matsukawa, Tetsu [1 ,2 ]
Saigo, Hiroto [1 ,2 ]
Suzuki, Einoshin [1 ,2 ,3 ]
机构
[1] Kyushu Univ, Grad Sch, Fukuoka, Japan
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka, Japan
[3] Kyushu Univ, Grad Sch Syst Life Sci, Fukuoka, Japan
基金
日本学术振兴会;
关键词
DISCOVERY; MODEL;
D O I
10.1007/978-3-030-47426-3_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new generative model for topic segmentation based on Latent Dirichlet Allocation. The task is to divide a document into a sequence of topically coherent segments, while preserving long topic change-points (coherency) and keeping short topic segments from getting merged (saliency). Most of the existing models either fuse topic segments by keywords or focus on modeling word co-occurrence patterns without merging. They can hardly achieve both coherency and saliency since many words have high uncertainties in topic assignments due to their polysemous nature. To solve this problem, we introduce topic-specific co-occurrence of word pairs within contexts in modeling, to generate more coherent segments and alleviate the influence of irrelevant words on topic assignment. We also design an optimization algorithm to eliminate redundant items in the generated topic segments. Experimental results show that our proposal produces significant improvements in both topic coherence and topic segmentation.
引用
收藏
页码:475 / 486
页数:12
相关论文
共 50 条
  • [21] Overlapped latent Dirichlet allocation for efficient image segmentation
    Young-Seob Jeong
    Ho-Jin Choi
    Soft Computing, 2015, 19 : 829 - 838
  • [22] Overlapped latent Dirichlet allocation for efficient image segmentation
    Jeong, Young-Seob
    Choi, Ho-Jin
    SOFT COMPUTING, 2015, 19 (04) : 829 - 838
  • [23] A Context-Aware Topic Model for Statistical Machine Translation
    Su, Jinsong
    Xiong, Deyi
    Liu, Yang
    Han, Xianpei
    Lin, Hongyu
    Yao, Junfeng
    Zhang, Min
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 229 - 238
  • [24] Context-Aware Malware Detection Using Topic Modeling
    Stegner, Wayne
    Kapp, David
    Kebede, Temesguen
    Jha, Rashmi
    PROCEEDINGS OF THE 2021 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2021, : 326 - 331
  • [25] CARIS: Context-Aware Referring Image Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 779 - 788
  • [26] Dense Material Segmentation with Context-Aware Network
    Heng, Yuwen
    Wu, Yihong
    Dasmahapatra, Srinandan
    Kim, Hansung
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2022, 2023, 1815 : 66 - 88
  • [27] CNet: Context-Aware Network for Semantic Segmentation
    Cheng, Rongliang
    Zhang, Junge
    Yang, Peipei
    Liu, Kangwei
    Zhang, Shujun
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 67 - 72
  • [28] Context-Aware Domain Adaptation in Semantic Segmentation
    Yang, Jinyu
    An, Weizhi
    Yan, Chaochao
    Zhao, Peilin
    Huang, Junzhou
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 514 - 524
  • [29] Learning Context-Aware Classifier for Semantic Segmentation
    Tian, Zhuotao
    Cui, Jiequan
    Jiang, Li
    Qi, Xiaojuan
    Lai, Xin
    Chen, Yixin
    Liu, Shu
    Jia, Jiaya
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2438 - 2446
  • [30] CAP: Context-Aware Pruning for Semantic Segmentation
    He, Wei
    Wu, Meiqing
    Liang, Mingfu
    Lam, Siew-Kei
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 959 - 968