An Entailment-based Scoring Method for Content Selection in Document Summarization

被引：2

作者：

Dang Hoang Long ^{[1
]}

Minh-Tien Nguyen ^{[2
]}

Ngo Xuan Bach ^{[1
]}

Le-Minh Nguyen ^{[3
]}

Tu Minh Phuong ^{[1
]}

机构：

[1] Posts & Telecommun Inst Technol, Hanoi, Vietnam

[2] Hung Yen Univ Technol & Educ, Hung Yen, Vietnam

[3] Japan Adv Inst Sci & Technol, 1-8 Asahidai, Nomi, Ishikawa, Japan

来源：

PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2018) | 2018年

关键词：

Web Document Summarization; Entailment; Sentence Scoring; Integer Linear Programming (ILP);

D O I：

10.1145/3287921.3287976

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper introduces a scoring method to improve the quality of content selection in an extractive summarization system. Different from previous models mainly using local information inside sentences such as sentence position or sentence length, our method judges the importance of a sentence based on its own information and the relation between sentences. For the relation between sentences, we utilize textual entailment, a relationship indicating that the meaning of a sentence can be inferred from another one. Unlike previous work on using textual entailment for summarization, we go a step further by looking at aligned words in an entailment sentence pair. Assuming that important words in a salient sentence can be aligned by several words in other sentences, word alignment scores are exploited to compute the entailment score of a sentence. To take advantage of local and neighbor information for facilitating the salient estimation of sentences, we combine entailment scores with sentence position scores. We validate the proposed scoring method with greedy or integer linear programming approaches for extracting summaries. Experiments on three datasets (including DUC 2001 and 2002) in two different domains show that our model obtains competitive ROUGE-scores with state-of-the-art methods for single-document summarization.

引用

页码：122 / 129

页数：8

共 50 条

[1] ENTAILMENT-BASED LINEAR SEGMENTATION IN SUMMARIZATION
Tatar, Doina
Mihis, Andreea
Lupsa, Dana
Tamaianu-Morita, Emma
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (08) : 1023 - 1038
[2] Textual Entailment-Based Figure Summarization for Biomedical Articles
Saini, Naveen
Saha, Sriparna
Bhattacharyya, Pushpak
Tuteja, Himanshu
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
[3] Enhance Content Selection for Multi-Document Summarization with Entailment Relation
Wang, Yu-Yun
Wu, Jhen-Yi
Chou, Tzu-Hsuan
Lin, Ying-Jia
Kao, Hung-Yu
2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 119 - 124
[4] OPINESUM: Entailment-based self-training for abstractive opinion summarization
Louis, Annie
Maynez, Joshua
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10774 - 10790
[5] Entailment-based actions for coordination
Monteiro, L
Porto, A
THEORETICAL COMPUTER SCIENCE, 1998, 192 (02) : 259 - 286
[6] PE-MSC: partial entailment-based minimum set cover for text summarization
Gupta, Anand
Kaur, Manpreet
Mittal, Sonaali
Garg, Swati
KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (05) : 1045 - 1068
[7] EntailSum: An Entailment-Based Approach to Aspect-Based Text Summarization with Automated Aspect Adaptation
Ankner, Zachary
Balaji, Purvaja
Zhu, Ye
Hiew, Chun Keat
Wang, Patrick
Gupta, Amar
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (13)
[8] PE-MSC: partial entailment-based minimum set cover for text summarization
Anand Gupta
Manpreet Kaur
Sonaali Mittal
Swati Garg
Knowledge and Information Systems, 2021, 63 : 1045 - 1068
[9] Document Summarization using a Scoring-Based Representation
Villa Monte, Augusto
Lanzarini, Laura
Rojas Flores, Luis
Olivas Varela, Jose A.
PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,
[10] A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization
Zhou, Qingyu
Yang, Nan
Wei, Furu
Huang, Shaohan
Zhou, Ming
Zhao, Tiejun
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 671 - 681

← 1 2 3 4 5 →