A Novel Contextual Topic Model for Query-focused Multi-document Summarization

被引:6
|
作者
Yang, Guangbing [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu 80101, Finland
关键词
Machine learning; hierarchical topic model; text summarization;
D O I
10.1109/ICTAI.2014.92
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of the oft-decried information overload negatively impacts comprehension of useful information. How to solve this problem has given rise to increase of interest in research on multi-document summarization. With the aim of seeking a new method to help justify the importance and similarity of sentences in multi-document summarization, this study proposes a novel approach based on well-known hierarchical Bayesian topic models. By investigating hierarchical topics and their correlations with respect to the lexical co-occurrences of words, the proposed contextual topic model can determine the relevance of sentences more effectively, and recognize latent topics and arrange them hierarchically as well. The quantitative evaluation results show that this model has outperformed hLDA and LDA in document modeling. In addition, a practical application demonstrates that a summarization system implementing this model can significantly improve the performance of summarization and make it comparable to state-of-the-art summarization systems.
引用
收藏
页码:576 / 583
页数:8
相关论文
共 50 条
  • [31] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [32] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [33] Coarse-to-Fine Query Focused Multi-Document Summarization
    Xu, Yumo
    Lapata, Mirella
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3632 - 3645
  • [34] Dual pattern-enhanced representations model for query-focused multi-document summarisation
    Wu, Yutong
    Li, Yuefeng
    Xu, Yue
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 736 - 748
  • [35] Using Proximity in Query Focused Multi-document Extractive Summarization
    Li, Sujian
    Zhang, Yu
    Wang, Wei
    Wang, Chen
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 179 - 188
  • [36] Query-focused multi-document summarization using co-training based semi-supervised learning
    Hu, Po
    Ji, Donghong
    Wang, Hai
    Teng, Chong
    PACLIC 23 - Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009, 1 : 190 - 199
  • [37] A Novel Relational Learning-to-Rank Approach for Topic-Focused Multi-Document Summarization
    Zhu, Yadong
    Lan, Yanyan
    Guo, Jiafeng
    Du, Pan
    Cheng, Xueqi
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 927 - 936
  • [38] Bayesian Query-Focused Summarization
    Daume, Hal, III
    Marcu, Daniel
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 305 - 312
  • [39] An Aspect-Driven Random Walk Model for Topic-Focused Multi-document Summarization
    Chali, Yllias
    Hasan, Sadid A.
    Imam, Kaisar
    INFORMATION RETRIEVAL TECHNOLOGY, 2011, 7097 : 386 - 397
  • [40] Tiered sentence based topic model for multi-document summarization
    Akhtar, Nadeem
    Beg, M. M. Sufyan
    Javed, Hira
    Hussain, Md Muzakkir
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 2131 - 2141