Effective aggregation of various summarization techniques

被引:31
|
作者
Mehta, Parth [1 ]
Majumder, Prasenjit [2 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Near Indroda Circle, Gandhinagar 382007, Gujarat, India
[2] Phirlthhai Ambani Inst informat & Commun Technol, 4209\ FB-4,DA IICT,Near Indroda Circle, Gandhinagar 382007, Gujarat, India
关键词
Summarization; Ensemble; SENTENCE; EXTRACTION;
D O I
10.1016/j.ipm.2017.11.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A large number of extractive summarization techniques have been developed in the past decade, but very few enquiries have been made as to how these differ from each other or what are the factors that actually affect these systems. Such meaningful comparison if available can be used to create a robust ensemble of these approaches, which has the possibility to consistently outperform each individual summarization system. In this work we examine the roles of three principle components of an extractive summarization technique: sentence ranking algorithm, sentence similarity metric and text representation scheme. We show that using a combination of several different sentence similarity measures, rather than only one, significantly improves performance of the resultant meta-system. Even simple ensemble techniques, when used in an informed manner, prove to be very effective in improving the overall performance and consistency of summarization systems. A statistically significant improvement of about 5% to 10% in ROUGE-1 recall was achieved by aggregating various sentence similarity measures. As opposed to this aggregation of several ranking algorithms did not show a significant improvement in ROUGE score, but even in this case the resultant meta-systems were more robust than candidate systems. The results suggest that new extractive summarization techniques should particularly focus on defining a better sentence similarity metric and use multiple sentence similarity scores and ranking algorithms in favour of a particular combination.
引用
收藏
页码:145 / 158
页数:14
相关论文
共 50 条
  • [11] Video Summarization: Techniques and Classification
    Ajmal, Muhammad
    Ashraf, Muhammad Husnain
    Shakir, Muhammad
    Abbas, Yasir
    Shah, Faiz Ali
    COMPUTER VISION AND GRAPHICS, 2012, 7594 : 1 - 13
  • [12] A Survey on Abstractive Summarization Techniques
    Rachabathuni, Pavan Kartheek
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 762 - 765
  • [13] Improving Abstractive Text Summarization with History Aggregation
    Liao, Pengcheng
    Zhang, Chuang
    Chen, Xiaojun
    Zhou, Xiaofei
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [14] CTRAS: Crowdsourced Test Report Aggregation and Summarization
    Hao, Rui
    Feng, Yang
    Jones, James A.
    Li, Yuying
    Chen, Zhenyu
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 900 - 911
  • [15] Study of Various Text Summarization Methods
    Khan, Sarim
    Pathak, Abhay
    Chopra, Rishabh
    Parihar, Hemant Singh
    Kaur, Preet Chandan
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 115 - 126
  • [16] Comparison of Sentiment Analysis on Auto-Summarized Text & Original Text using various Summarization Techniques
    Kandpal, Prathamesh
    Wadkar, Yash
    Attri, Harsh
    Bhorge, Siddharth
    2020 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2020, : 206 - 211
  • [17] A Survey of Security Concerns in Various Data Aggregation Techniques in Wireless Sensor Networks
    Kumar, Mukesh
    Dutta, Kamlesh
    INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 1 - 15
  • [18] A Survey of Unstructured Text Summarization Techniques
    Elfayoumy, Sherif
    Thoppil, Jenny
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (04) : 149 - 154
  • [20] Effective summarization method of text documents
    Alguliev, RM
    Aliguliyev, RM
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 264 - 271