An Entailment-based Scoring Method for Content Selection in Document Summarization

被引:2
|
作者
Dang Hoang Long [1 ]
Minh-Tien Nguyen [2 ]
Ngo Xuan Bach [1 ]
Le-Minh Nguyen [3 ]
Tu Minh Phuong [1 ]
机构
[1] Posts & Telecommun Inst Technol, Hanoi, Vietnam
[2] Hung Yen Univ Technol & Educ, Hung Yen, Vietnam
[3] Japan Adv Inst Sci & Technol, 1-8 Asahidai, Nomi, Ishikawa, Japan
关键词
Web Document Summarization; Entailment; Sentence Scoring; Integer Linear Programming (ILP);
D O I
10.1145/3287921.3287976
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces a scoring method to improve the quality of content selection in an extractive summarization system. Different from previous models mainly using local information inside sentences such as sentence position or sentence length, our method judges the importance of a sentence based on its own information and the relation between sentences. For the relation between sentences, we utilize textual entailment, a relationship indicating that the meaning of a sentence can be inferred from another one. Unlike previous work on using textual entailment for summarization, we go a step further by looking at aligned words in an entailment sentence pair. Assuming that important words in a salient sentence can be aligned by several words in other sentences, word alignment scores are exploited to compute the entailment score of a sentence. To take advantage of local and neighbor information for facilitating the salient estimation of sentences, we combine entailment scores with sentence position scores. We validate the proposed scoring method with greedy or integer linear programming approaches for extracting summaries. Experiments on three datasets (including DUC 2001 and 2002) in two different domains show that our model obtains competitive ROUGE-scores with state-of-the-art methods for single-document summarization.
引用
收藏
页码:122 / 129
页数:8
相关论文
共 50 条
  • [41] A sentence scoring method for extractive text summarization based on natural language queries
    I.T Department, G.V.P College of Engineering, Visakhapatnam, Andhra Pradesh 530048, India
    不详
    Int. J. Comput. Sci. Issues, 3 (259-262):
  • [42] Document Summarization Based on Semantic Representations
    Zhang, Hui
    Zhang, Xueliang
    Gao, Guanglai
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 152 - 155
  • [43] Document Summarization Based on Word Associations
    Gross, Oskar
    Doucet, Antoine
    Toivonen, Hannu
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1023 - 1026
  • [44] WordNet-based document summarization
    Dang, Chenghua
    Luo, Xinjun
    WSEAS: ADVANCES ON APPLIED COMPUTER AND APPLIED COMPUTATIONAL SCIENCE, 2008, : 383 - +
  • [45] Investigations in Single Document Summarization by Extraction Method
    Hariharan, Shanmugasundaram
    Srinivasan, Rengaramanujam
    ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 106 - +
  • [46] A behavior mode for content unit selection in summarization
    Teng, Chong
    He, Yanxiang
    Liu, Dexi
    Ji, Donghong
    Yang, Hua
    Xiong, Naixue
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 250 - 254
  • [47] Content Selection in Deep Learning Models of Summarization
    Kedzie, Chris
    McKeown, Kathleen
    Daume, Hal, III
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1818 - +
  • [48] A fusion of variants of sentence scoring methods and collaborative word rankings for document summarization
    Verma, Pradeepika
    Verma, Anshul
    Pal, Sukomal
    EXPERT SYSTEMS, 2022, 39 (06)
  • [49] PERSONALIZED VIDEO SUMMARIZATION BASED ON GROUP SCORING
    Darabi, Kaveh
    Ghinea, Gheorghita
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 310 - 314
  • [50] Video Summarization Based on ListNet Scoring Mechanism
    Wu Guangli
    Li Leiting
    Guo Zhenzhou
    Wang Chengxiang
    Yao Yanpeng
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 281 - 285