A preference learning approach to sentence ordering for multi-document summarization

被引:22
|
作者
Bollegala, Danushka [1 ]
Okazaki, Naoaki [1 ]
Ishizuka, Mitsuru [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan
关键词
Sentence ordering; Multi-document summarization; Natural language processing;
D O I
10.1016/j.ins.2012.06.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ordering information is a difficult but an important task for applications generating natural-language texts such as multi-document summarization, question answering, and concept-to-text generation. In multi-document summarization, information is selected from a set of source documents. Therefore, the optimal ordering of those selected pieces of information to create a coherent summary is not obvious. Improper ordering of information in a summary can both confuse the reader and deteriorate the readability of the summary. Therefore, it is vital to properly order the information in multi-document summarization. We model the problem of sentence ordering in multi-document summarization as a one of learning the optimal combination of preference experts that determine the ordering between two given sentences. To capture the preference of a sentence against another sentence, we define five preference experts: chronology, probabilistic, topical-closeness, precedence, and succession. We use summaries ordered by human annotators as training data to learn the optimal combination of the different preference experts. Finally, the learnt combination is applied to order sentences extracted in a multi-document summarization system. The proposed sentence ordering algorithm considers pairwise comparisons between sentences to determine a total ordering, using a greedy search algorithm, thereby avoiding the combinatorial time complexity typically associated with total ordering tasks. This enables us to efficiently order sentences in longer summaries, thereby rendering the proposed approach useable in real-world text summarization systems. We evaluate the sentence orderings produced by the proposed method and numerous other baselines using both semi-automatic evaluation measures as well as performing a subjective evaluation. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:78 / 95
页数:18
相关论文
共 50 条
  • [1] A topic Approach to Sentence Ordering for Multi-document Summarization
    Na, Liu
    Peng, Xiao
    Ying, Lu
    Tang Xiao-jun
    Wang Hai-wen
    Li Ming-xia
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1390 - 1395
  • [2] A bottom-up approach to sentence ordering for multi-document summarization
    Bollegala, Danushka
    Okazaki, Naoaki
    Ishizuka, Mitsuru
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (01) : 89 - 109
  • [3] A Bottom-up Approach to Sentence Ordering for Multi-document Summarization
    Bollegala, Danushka
    Okazaki, Naoaki
    Ishizuka, Mitsuru
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 385 - 392
  • [4] An adjacency model for sentence ordering in multi-document summarization
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 313 - 322
  • [5] A hybrid sentence ordering strategy in multi-document summarization
    He, Yanxiang
    Liu, Dexi
    Yang, Hua
    Ji, Donghong
    Teng, Chong
    Qi, Wenqing
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 339 - 349
  • [6] Cohesion-based Sentence Ordering for Multi-document Summarization
    Jiang, Xiaoyu
    2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 78 - 83
  • [7] A hybrid model for sentence ordering in extractive multi-document summarization
    Liu, Dexi
    Zhang, Zengchang
    He, Yanxiang
    Ji, Donghong
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 588 - 592
  • [8] Multi-document summarization sentence ordering algorithm using semantic analysis
    Ji, Min
    Liao, Junbi
    Lei, Jingfa
    Yuan, Zhongfan
    Advances in Information Sciences and Service Sciences, 2012, 4 (14): : 125 - 131
  • [9] MRS for multi-document summarization by sentence extraction
    Yong-Dong Xu
    Xiao-Dong Zhang
    Guang-Ri Quan
    Ya-Dong Wang
    Telecommunication Systems, 2013, 53 : 91 - 98
  • [10] MRS for multi-document summarization by sentence extraction
    Xu, Yong-Dong
    Zhang, Xiao-Dong
    Quan, Guang-Ri
    Wang, Ya-Dong
    TELECOMMUNICATION SYSTEMS, 2013, 53 (01) : 91 - 98