Intra-document and Inter-document Redundancy in Multi-document Summarization

被引:1
|
作者
Carrillo-Mendoza, Pabel [1 ]
Calvo, Hiram [1 ]
Gelbukh, Alexander [1 ]
机构
[1] Inst Politecn Nacl, CIC, Ave Juan de Dios Batiz, Mexico City 07738, DF, Mexico
关键词
Multi-document summarization; Graph-based methods; Unsupervised summarization; Doc2vec; Intra-document redundancy; Per-document redundancy; Inter-document redundancy; Cross-documents redundancy;
D O I
10.1007/978-3-319-62434-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 50 条
  • [21] Causal Maps for Multi-Document Summarization
    Strelnikoff, Sasha
    Jammalamadaka, Aruna
    Warmsley, Dana
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4437 - 4445
  • [22] A novel approach to multi-document summarization
    Qiu, Li-Qing
    Pang, Bin
    Lin, Sai-Qun
    Chen, Peng
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
  • [23] Hierarchical Transformers for Multi-Document Summarization
    Liu, Yang
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
  • [24] The great importance of cross-document relationships for multi-document summarization
    Wan, Xiaojun
    Yang, Jianwu
    Xiao, Jianguo
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 131 - +
  • [25] Abstractive Multi-Document Summarization via Joint Learning with Single-Document Summarization
    Jin, Hanqi
    Wan, Xiaojun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2545 - 2554
  • [26] Exploiting cross-document relations for multi-document evolving summarization
    Afantenos, SD
    Doura, I
    Kapellou, E
    Karkaletsis, V
    METHODS AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3025 : 410 - 419
  • [27] Hierarchical Summarization: Scaling Up Multi-Document Summarization
    Christensen, Janara
    Soderland, Stephen
    Bansal, Gagan
    Mausam
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 902 - 912
  • [28] Minimum redundancy and maximum relevance for single and multi-document Arabic text summarization
    Oufaida, Houda
    Nouali, Omar
    Blache, Philippe
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 450 - 461
  • [29] Robust intra-document locations
    Phelps, TA
    Wilensky, R
    COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6): : 105 - 118
  • [30] MCRMR: Maximum coverage and relevancy with minimal redundancy based multi-document summarization
    Verma, Pradeepika
    Om, Hari
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 : 43 - 56