HIERARCHICAL THEME AND TOPIC MODEL FOR SUMMARIZATION

被引:1
|
作者
Chien, Jen-Tzung [1 ]
Chang, Ying-Lan [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
Topic model; structural learning; Bayesian nonparametrics; document summarization; DIRICHLET;
D O I
10.1109/MLSP.2013.6661943
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a hierarchical summarization model to extract representative sentences from a set of documents. In this study, we select the thematic sentences and identify the topical words based on a hierarchical theme and topic model (H2TM). The latent themes and topics are inferred from document collection. A tree stick-breaking process is proposed to draw the theme proportions for representation of sentences. The structural learning is performed without fixing the number of themes and topics. This H2TM is delicate and flexible to represent words and sentences from heterogeneous documents. Thematic sentences are effectively extracted for document summarization. In the experiments, the proposed H2TM outperforms the other methods in terms of precision, recall and F-measure.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Neural Attention-Aware Hierarchical Topic Model
    Jin, Yuan
    Zhao, He
    Liu, Ming
    Du, Lan
    Buntine, Wray
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1042 - 1052
  • [42] Scale-Invariant Infinite Hierarchical Topic Model
    Eshima, Shusei
    Mochihashi, Daichi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11731 - 11746
  • [43] Causality Model for Text Data with a Hierarchical Topic Structure
    Ogawa, Takuro
    Shimadzu, Hideyasu
    Saga, Ryosuke
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 205 - 210
  • [44] Topic level summary generation using BERT induced Abstractive Summarization Model
    Ramina, Mayank
    Darnay, Nihar
    Ludbe, Chirag
    Dhruv, Ajay
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 747 - 752
  • [45] Sentiment Lexicon Construction With Hierarchical Supervision Topic Model
    Deng, Dong
    Jing, Liping
    Yu, Jian
    Sun, Shaolong
    Ng, Michael K.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (04) : 704 - 718
  • [46] A hierarchical latent topic model based on sparse coding
    Zhu, Wenjun
    Zhang, Liqing
    Bian, Qianwei
    NEUROCOMPUTING, 2012, 76 (01) : 28 - 35
  • [47] Bayesian Text Classification and Summarization via A Class-Specified Topic Model
    Wang, Feifei
    Zhang, Junni L.
    Li, Yichao
    Deng, Ke
    Liu, Jun S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [48] A novel abstractive summarization model based on topic-aware and contrastive learning
    Tang, Huanling
    Li, Ruiquan
    Duan, Wenhao
    Dou, Quansheng
    Lu, Mingyu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 5563 - 5577
  • [49] Bayesian text classification and summarization via a class-specified topic model
    Wang, Feifei
    Zhang, Junni L.
    Li, Yichao
    Deng, Ke
    Liu, Jun S.
    1600, Microtome Publishing (22):
  • [50] Concept-based Topic Attention for a Convolutional Sequence Document Summarization Model
    Khanam, Shirin Akther
    Liu, Fei
    Chen, Yi-Ping Phoebe
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,