Intra-document and Inter-document Redundancy in Multi-document Summarization

被引:1
|
作者
Carrillo-Mendoza, Pabel [1 ]
Calvo, Hiram [1 ]
Gelbukh, Alexander [1 ]
机构
[1] Inst Politecn Nacl, CIC, Ave Juan de Dios Batiz, Mexico City 07738, DF, Mexico
关键词
Multi-document summarization; Graph-based methods; Unsupervised summarization; Doc2vec; Intra-document redundancy; Per-document redundancy; Inter-document redundancy; Cross-documents redundancy;
D O I
10.1007/978-3-319-62434-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 50 条
  • [1] On redundancy in multi-document summarization
    Calvo, Hiram
    Carrillo-Mendoza, Pabel
    Gelbukh, Alexander
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3245 - 3255
  • [2] Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph
    Atri, Yash Kumar
    Iyer, Arun
    Chakraborty, Tanmoy
    Goyal, Vikram
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2154 - 2166
  • [3] A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization
    Parnell, Jacob
    Unanue, Inigo Jauregi
    Piccardi, Massimo
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5112 - 5128
  • [4] MULTI-DOCUMENT VIDEO SUMMARIZATION
    Wang, Feng
    Merialdo, Bernard
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1326 - 1329
  • [5] Abstractive Multi-Document Summarization
    Ranjitha, N. S.
    Kallimani, Jagadish S.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1690 - 1693
  • [6] Document-Based HITS Model for Multi-document Summarization
    Wan, Xiaojun
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 454 - 465
  • [7] A document-sensitive graph model for multi-document summarization
    Furu Wei
    Wenjie Li
    Qin Lu
    Yanxiang He
    Knowledge and Information Systems, 2010, 22 : 245 - 259
  • [8] A document-sensitive graph model for multi-document summarization
    Wei, Furu
    Li, Wenjie
    Lu, Qin
    He, Yanxiang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) : 245 - 259
  • [9] Inter and intra-document contexts applied in polyrepresentation
    Skov, Mette
    Larsen, Birger
    Ingwersen, Peter
    Information Interaction in Context, Proceedings, 2006, : 163 - 170
  • [10] Weighted consensus multi-document summarization
    Wang, Dingding
    Li, Tao
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 513 - 523