Heuristic Initialization And Similarity Integration Based Model for Improving Extractive Multi-Document Summarization

被引:0
|
作者
Kadhim, Nasreen J. [1 ]
Mohammed, Dheyaa Abdulameer [2 ]
机构
[1] Univ Baghdad, Dept Comp Sci, Coll Sci, Baghdad, Iraq
[2] Imam Jaafar Al Sadiq Univ, Baghdad, Iraq
关键词
Heuristic Initialization; integrations of similarity measures; Gisting Evaluation (ROUGE); optimization based model;
D O I
10.26782/jmcms.2019.10.00025
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Currently, the prominence of automatic multi document summarization task belongs to the information rapid increasing on the Internet. Automatic document summarization technology is progressing and may offer a solution to the problem of information overload. Automatic text summarization system has the challenge of producing high quality summary. In this paper, the design of generic text summarization model based on sentence extraction has been redirected into more semantic measure reflecting the two significant objectives: content coverage and diversity when generating summaries from multiple documents as an explicit optimization model. The proposed two models have been then coupled and defined as single-objective optimization problem. Also, different integrations of similarity measures have been introduced and applied to the proposed model in addition to the single similarity measure that bases on using Cosine, Dice and Jaccard similarity measures for measuring text similarity involving integrating double similarity measures and triple similarity measures. The proposed optimization model has been solved using Genetic Algorithm. Moreover, heuristic initialization has been proposed and injected into the adopted evolutionary algorithm to harness its strength. Document sets supplied by Document Understanding Conference 2002 (DUC2002) have been used for the proposed system as an evaluation dataset and as an evaluation metric, Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit has been used for performance evaluation of the proposed method and for performance comparison against other baseline systems. Comparison results for the proposed optimization based model against other baselines verified that the proposed system outperforms other baseline approaches in terms of Rouge - 2 and Rouge -1 scores wherein it has recorded a score of 0.4542 for Rouge - 1 and 0.1623 for Rouge - 2.
引用
收藏
页码:330 / 350
页数:21
相关论文
共 50 条
  • [1] Improving the Similarity Measure of Determinantal Point Processes for Extractive Multi-Document Summarization
    Cho, Sangwoo
    Lebanoff, Logan
    Foroosh, Hassan
    Liu, Fei
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1027 - 1038
  • [2] A hybrid model for sentence ordering in extractive multi-document summarization
    Liu, Dexi
    Zhang, Zengchang
    He, Yanxiang
    Ji, Donghong
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 588 - 592
  • [3] Extractive Multi-document Text Summarization Leveraging Hybrid Semantic Similarity Measures
    Bandaru, Rajesh
    Radhika, Dr. Y.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 844 - 852
  • [4] Multi-document extractive text summarization based on firefly algorithm
    Tomer, Minakshi
    Kumar, Manoj
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6057 - 6065
  • [5] Chinese Multi-document Summarization Based on Opinion Similarity
    Liu, Rui
    An, Yi
    Song, Lang
    [J]. CEIS 2011, 2011, 15
  • [6] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [7] Extractive multi-document text summarization based on graph independent sets
    Uckan, Taner
    Karci, Ali
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 145 - 157
  • [8] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    [J]. ELECTRONICS, 2023, 12 (08)
  • [9] Extractive multi-document summarization using multilayer networks
    Tohalino, Jorge V.
    Amancio, Diego R.
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 503 : 526 - 539
  • [10] Unsupervised Framework for Comment-based Multi-document Extractive Summarization
    Roha, Vishal Singh
    Saini, Naveen
    Saha, Sriparna
    Moreno, Jose G.
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 574 - 582