An Entailment-based Scoring Method for Content Selection in Document Summarization

被引:2
|
作者
Dang Hoang Long [1 ]
Minh-Tien Nguyen [2 ]
Ngo Xuan Bach [1 ]
Le-Minh Nguyen [3 ]
Tu Minh Phuong [1 ]
机构
[1] Posts & Telecommun Inst Technol, Hanoi, Vietnam
[2] Hung Yen Univ Technol & Educ, Hung Yen, Vietnam
[3] Japan Adv Inst Sci & Technol, 1-8 Asahidai, Nomi, Ishikawa, Japan
关键词
Web Document Summarization; Entailment; Sentence Scoring; Integer Linear Programming (ILP);
D O I
10.1145/3287921.3287976
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces a scoring method to improve the quality of content selection in an extractive summarization system. Different from previous models mainly using local information inside sentences such as sentence position or sentence length, our method judges the importance of a sentence based on its own information and the relation between sentences. For the relation between sentences, we utilize textual entailment, a relationship indicating that the meaning of a sentence can be inferred from another one. Unlike previous work on using textual entailment for summarization, we go a step further by looking at aligned words in an entailment sentence pair. Assuming that important words in a salient sentence can be aligned by several words in other sentences, word alignment scores are exploited to compute the entailment score of a sentence. To take advantage of local and neighbor information for facilitating the salient estimation of sentences, we combine entailment scores with sentence position scores. We validate the proposed scoring method with greedy or integer linear programming approaches for extracting summaries. Experiments on three datasets (including DUC 2001 and 2002) in two different domains show that our model obtains competitive ROUGE-scores with state-of-the-art methods for single-document summarization.
引用
收藏
页码:122 / 129
页数:8
相关论文
共 50 条
  • [31] CONENTAIL: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining
    Zhang, Ranran Haoran
    Fan, Aysa Xuemo
    Zhang, Rui
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1941 - 1953
  • [32] Fuzzy logic based multi document summarization with improved sentence scoring and redundancy removal technique
    Patel, Darshna
    Shah, Saurabh
    Chhinkaniwala, Hitesh
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 134 : 167 - 177
  • [33] A Combination of Visual-Semantic Reasoning and Text Entailment-based Boosting Algorithm for Cheapfake Detection
    Tuan-Vinh La
    Dao, Minh-Son
    Quang-Tien Tran
    Thanh-Phuc Tran
    Anh-Duy Tran
    Duc Tien Dang Nguyen
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7140 - 7144
  • [34] Automated Bengali Document Summarization By Collaborating Individual Word & Sentence Scoring
    Chandro, Porimol
    Arif, Md Faizul Huq
    Rahman, Md Mahbubur
    Siddik, Md Saeed
    Rahman, Mohammad Sayeedur
    Rahman, Md Abdur
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [35] Subtopic-focused sentence scoring in multi-document summarization
    Li Sujian
    Qu Weiguang
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 98 - +
  • [36] Summarization as Feature Selection for Document Categorization on Small Datasets
    Anguiano-Hernandez, Emmanuel
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    Rosso, Paolo
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 39 - +
  • [37] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (03)
  • [38] Multi Document Summarization Evaluation in the Presence of Damaging Content
    Manevich, Avshalom
    Carmel, David
    Cohen, Nachshon
    Kravi, Elad
    Shapira, Ori
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 1 - 12
  • [39] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (01)
  • [40] A novel partitioning-based clustering method and generic document summarization
    Aliguliyev, Ramiz M.
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 626 - 629