Intra-document and Inter-document Redundancy in Multi-document Summarization

被引:1
|
作者
Carrillo-Mendoza, Pabel [1 ]
Calvo, Hiram [1 ]
Gelbukh, Alexander [1 ]
机构
[1] Inst Politecn Nacl, CIC, Ave Juan de Dios Batiz, Mexico City 07738, DF, Mexico
关键词
Multi-document summarization; Graph-based methods; Unsupervised summarization; Doc2vec; Intra-document redundancy; Per-document redundancy; Inter-document redundancy; Cross-documents redundancy;
D O I
10.1007/978-3-319-62434-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 50 条
  • [41] A Game Theory Approach for Multi-document Summarization
    Ahmad, Amreen
    Ahmad, Tanvir
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (04) : 3655 - 3667
  • [42] Multi-document summarization based on unsupervised clustering
    Ji, Paul
    INFORMATION RETRIEVAL TECHNOLOLGY, PROCEEDINGS, 2006, 4182 : 560 - 566
  • [43] Geodesic Distance based Multi-document Summarization
    Ma, Huifang
    He, Qing
    Shi, Zhongzhi
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 54 - 59
  • [44] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [45] MRS for multi-document summarization by sentence extraction
    Yong-Dong Xu
    Xiao-Dong Zhang
    Guang-Ri Quan
    Ya-Dong Wang
    Telecommunication Systems, 2013, 53 : 91 - 98
  • [46] Multi-document Summarization for E-Learning
    Wang, Fu Lee
    Kwan, Reggie
    Hung, Sheung Lun
    HYBRID LEARNING AND EDUCATION, PROCEEDINGS, 2009, 5685 : 353 - +
  • [47] A New Approach for Multi-Document Update Summarization
    Chong Long
    Min-Lie Huang
    Xiao-Yan Zhu
    Ming Li
    Journal of Computer Science and Technology, 2010, 25 : 739 - 749
  • [48] Enhancing multi-document summarization using concepts
    Pattabhi R K Rao
    S Lalitha Devi
    Sādhanā, 2018, 43
  • [49] Personalized Multi-Document Summarization in information retrieval
    Yang, Xiao-Peng
    Liu, Xiao-Rong
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 4108 - +
  • [50] Identification of Event and Topic for Multi-document Summarization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    Matsuyoshi, Suguru
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 304 - 316