An Entailment-based Scoring Method for Content Selection in Document Summarization

被引:2
|
作者
Dang Hoang Long [1 ]
Minh-Tien Nguyen [2 ]
Ngo Xuan Bach [1 ]
Le-Minh Nguyen [3 ]
Tu Minh Phuong [1 ]
机构
[1] Posts & Telecommun Inst Technol, Hanoi, Vietnam
[2] Hung Yen Univ Technol & Educ, Hung Yen, Vietnam
[3] Japan Adv Inst Sci & Technol, 1-8 Asahidai, Nomi, Ishikawa, Japan
关键词
Web Document Summarization; Entailment; Sentence Scoring; Integer Linear Programming (ILP);
D O I
10.1145/3287921.3287976
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces a scoring method to improve the quality of content selection in an extractive summarization system. Different from previous models mainly using local information inside sentences such as sentence position or sentence length, our method judges the importance of a sentence based on its own information and the relation between sentences. For the relation between sentences, we utilize textual entailment, a relationship indicating that the meaning of a sentence can be inferred from another one. Unlike previous work on using textual entailment for summarization, we go a step further by looking at aligned words in an entailment sentence pair. Assuming that important words in a salient sentence can be aligned by several words in other sentences, word alignment scores are exploited to compute the entailment score of a sentence. To take advantage of local and neighbor information for facilitating the salient estimation of sentences, we combine entailment scores with sentence position scores. We validate the proposed scoring method with greedy or integer linear programming approaches for extracting summaries. Experiments on three datasets (including DUC 2001 and 2002) in two different domains show that our model obtains competitive ROUGE-scores with state-of-the-art methods for single-document summarization.
引用
收藏
页码:122 / 129
页数:8
相关论文
共 50 条
  • [1] ENTAILMENT-BASED LINEAR SEGMENTATION IN SUMMARIZATION
    Tatar, Doina
    Mihis, Andreea
    Lupsa, Dana
    Tamaianu-Morita, Emma
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (08) : 1023 - 1038
  • [2] Textual Entailment-Based Figure Summarization for Biomedical Articles
    Saini, Naveen
    Saha, Sriparna
    Bhattacharyya, Pushpak
    Tuteja, Himanshu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [3] Enhance Content Selection for Multi-Document Summarization with Entailment Relation
    Wang, Yu-Yun
    Wu, Jhen-Yi
    Chou, Tzu-Hsuan
    Lin, Ying-Jia
    Kao, Hung-Yu
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 119 - 124
  • [4] OPINESUM: Entailment-based self-training for abstractive opinion summarization
    Louis, Annie
    Maynez, Joshua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10774 - 10790
  • [5] Entailment-based actions for coordination
    Monteiro, L
    Porto, A
    THEORETICAL COMPUTER SCIENCE, 1998, 192 (02) : 259 - 286
  • [6] PE-MSC: partial entailment-based minimum set cover for text summarization
    Gupta, Anand
    Kaur, Manpreet
    Mittal, Sonaali
    Garg, Swati
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (05) : 1045 - 1068
  • [7] EntailSum: An Entailment-Based Approach to Aspect-Based Text Summarization with Automated Aspect Adaptation
    Ankner, Zachary
    Balaji, Purvaja
    Zhu, Ye
    Hiew, Chun Keat
    Wang, Patrick
    Gupta, Amar
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (13)
  • [8] PE-MSC: partial entailment-based minimum set cover for text summarization
    Anand Gupta
    Manpreet Kaur
    Sonaali Mittal
    Swati Garg
    Knowledge and Information Systems, 2021, 63 : 1045 - 1068
  • [9] Document Summarization using a Scoring-Based Representation
    Villa Monte, Augusto
    Lanzarini, Laura
    Rojas Flores, Luis
    Olivas Varela, Jose A.
    PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,
  • [10] A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization
    Zhou, Qingyu
    Yang, Nan
    Wei, Furu
    Huang, Shaohan
    Zhou, Ming
    Zhao, Tiejun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 671 - 681