Effective aggregation of various summarization techniques

被引:31
|
作者
Mehta, Parth [1 ]
Majumder, Prasenjit [2 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Near Indroda Circle, Gandhinagar 382007, Gujarat, India
[2] Phirlthhai Ambani Inst informat & Commun Technol, 4209\ FB-4,DA IICT,Near Indroda Circle, Gandhinagar 382007, Gujarat, India
关键词
Summarization; Ensemble; SENTENCE; EXTRACTION;
D O I
10.1016/j.ipm.2017.11.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A large number of extractive summarization techniques have been developed in the past decade, but very few enquiries have been made as to how these differ from each other or what are the factors that actually affect these systems. Such meaningful comparison if available can be used to create a robust ensemble of these approaches, which has the possibility to consistently outperform each individual summarization system. In this work we examine the roles of three principle components of an extractive summarization technique: sentence ranking algorithm, sentence similarity metric and text representation scheme. We show that using a combination of several different sentence similarity measures, rather than only one, significantly improves performance of the resultant meta-system. Even simple ensemble techniques, when used in an informed manner, prove to be very effective in improving the overall performance and consistency of summarization systems. A statistically significant improvement of about 5% to 10% in ROUGE-1 recall was achieved by aggregating various sentence similarity measures. As opposed to this aggregation of several ranking algorithms did not show a significant improvement in ROUGE score, but even in this case the resultant meta-systems were more robust than candidate systems. The results suggest that new extractive summarization techniques should particularly focus on defining a better sentence similarity metric and use multiple sentence similarity scores and ranking algorithms in favour of a particular combination.
引用
收藏
页码:145 / 158
页数:14
相关论文
共 50 条
  • [21] The Study of Effective video summarization algorithm
    Ma, Donglin
    Zhang, Xijun
    Mi, Qian
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2364 - 2368
  • [22] Effective Replays and Summarization of Virtual Experiences
    Ponto, Kevin
    Kohlmann, Joe
    Gleicher, Michael
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (04) : 607 - 616
  • [23] Hierarchical Summarization Techniques for Network Traffic
    Mahmood, A. N.
    Leckie, C.
    Islam, R.
    Tari, Z.
    2011 6TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2011, : 2474 - 2479
  • [24] AHP TECHNIQUES FOR PERSIAN TEXT SUMMARIZATION
    Tofighy, Seyyed Mohsen
    Raj, Ram Gopal
    Javadi, Hamid Haj Seyyed
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2013, 26 (01) : 1 - 8
  • [25] A Comparative Survey of Text Summarization Techniques
    Watanangura P.
    Vanichrudee S.
    Minteer O.
    Sringamdee T.
    Thanngam N.
    Siriborvornratanakul T.
    SN Computer Science, 5 (1)
  • [26] A Summarization on PAPR Techniques for OFDM Systems
    Elavarasan P.
    Nagarajan G.
    Journal of The Institution of Engineers (India): Series B, 2015, 96 (04) : 381 - 389
  • [27] Experiments on Static Data Summarization Techniques
    Gandhi, Kalgi
    Pandat, Ami
    Bhise, Minal
    2021 IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2022, : 17 - 20
  • [28] Summarization on the Techniques of Testing Electronical System
    Du Min-Jie
    Ai Jin-Yan
    Liu Li-Min
    Zhu Sai
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 6437 - 6440
  • [29] An Effective Joint Framework for Document Summarization
    Gui, Min
    Zhang, Zhengkun
    Yang, Zhenglu
    Gu, Yanhui
    Xu, Guandong
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 121 - 122
  • [30] Comparison of Multi Document Summarization Techniques
    Nedunchelian, R.
    Muthucumarasamy, R.
    Saranathan, E.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (03): : 155 - 160