HIERARCHICAL THEME AND TOPIC MODEL FOR SUMMARIZATION

被引:1
|
作者
Chien, Jen-Tzung [1 ]
Chang, Ying-Lan [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
Topic model; structural learning; Bayesian nonparametrics; document summarization; DIRICHLET;
D O I
10.1109/MLSP.2013.6661943
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a hierarchical summarization model to extract representative sentences from a set of documents. In this study, we select the thematic sentences and identify the topical words based on a hierarchical theme and topic model (H2TM). The latent themes and topics are inferred from document collection. A tree stick-breaking process is proposed to draw the theme proportions for representation of sentences. The structural learning is performed without fixing the number of themes and topics. This H2TM is delicate and flexible to represent words and sentences from heterogeneous documents. Thematic sentences are effectively extracted for document summarization. In the experiments, the proposed H2TM outperforms the other methods in terms of precision, recall and F-measure.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Hierarchical Theme and Topic Modeling
    Chien, Jen-Tzung
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (03) : 565 - 578
  • [2] Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic Segment
    Qi, MengNan
    Liu, Hao
    Fu, YuZhuo
    Liu, Ting
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1121 - 1130
  • [3] Content Coverage Maximization on Word Networks for Hierarchical Topic Summarization
    Wang, Chi
    Yu, Xiao
    Li, Yanen
    Zhai, Chengxiang
    Han, Jiawei
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 249 - 258
  • [4] Topic Paraphrasing Model for Abstractive Dialogue Summarization
    Yang, Zhizhuo
    Zhang, Wei
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 495 - 500
  • [5] Document Summarization with VHTM: Variational Hierarchical Topic-Aware Mechanism
    Fu, Xiyan
    Wang, Jun
    Zhang, Jinghan
    Wei, Jinmao
    Yang, Zhenglu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7740 - 7747
  • [6] A Topic Model for Hierarchical Documents
    Yang, Yang
    Wang, Feifei
    Jiang, Fei
    Jin, Shuyuan
    Xu, Jin
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 118 - 126
  • [7] Building the summarization model of micro-blog topic
    Cai, Jun
    Zhang, Shunxiang
    Zhu, Hongze
    Zhu, Guangli
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 797 - 809
  • [8] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [9] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [10] Hierarchical Summarization of Text Documents Using Topic Modeling and Formal Concept Analysis
    Akhtar, Nadeem
    Javed, Hira
    Ahmad, Tameem
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 21 - 33