Text Summarization as a Multi-objective Optimization Task: Applying Harmony Search to Extractive Multi-Document Summarization

被引:3
|
作者
Bidoki, M. [1 ]
Fakhrahmad, M. [1 ]
Moosavi, M. R. [1 ]
机构
[1] Shiraz Univ, Sch Elect & Comp Engn, Dept Comp Sci & Engn, Shiraz 7134851154, Iran
来源
COMPUTER JOURNAL | 2022年 / 65卷 / 05期
关键词
multi-document extractive summarization; optimization problem; harmony search algorithm; sentence expansion; conceptual density tuning; language-independent approach; word embedding; graph-based ranking; objective function learning; text clustering; SENTENCE SCORING TECHNIQUES; MAXIMUM COVERAGE; ALGORITHM; REDUNDANCY; FRAMEWORK; SINGLE;
D O I
10.1093/comjnl/bxaa139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, automated extractive text summarization is one of the most common techniques for organizing information. In extractive summarization, the most appropriate sentences are selected from the text and build a representative summary. Therefore, probing for the best sentences is a fundamental task. This paper has coped with extractive summarization as a multi-objective optimization problem and proposed a language-independent, semantic-aware approach that applies the harmony search algorithm to generate appropriate multi-document summaries. It learns the objective function from an extra set of reference summaries and then generates the best summaries according to the trained function. The system also performs some supplementary activities for better achievements. It expands the sentences by using an inventive approach that aims at tuning conceptual densities in the sentences towards important topics. Furthermore, we introduced an innovative clustering method for identifying important topics and reducing redundancies. A sentence placement policy based on the Hamiltonian shortest path was introduced for producing readable summaries. The experiments were conducted on DUC2002, DUC2006 and DUC2007 datasets. Experimental results showed that the proposed framework could assist the summarization process and yield better performance. Also, it was able to generally outperform other cited summarizer systems.
引用
收藏
页码:1053 / 1072
页数:20
相关论文
共 50 条
  • [1] Parallelizing a multi-objective optimization approach for extractive multi-document text summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 134 : 166 - 179
  • [2] A decomposition-based multi-objective optimization approach for extractive multi-document text summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    [J]. APPLIED SOFT COMPUTING, 2020, 91
  • [3] Extractive multi-document text summarization using a multi-objective artificial bee colony optimization approach
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 159 : 1 - 8
  • [4] An Indicator-based Multi-Objective Optimization Approach Applied to Extractive Multi-Document Text Summarization
    Sanchez-Gomez, J.
    Vega-Rodriguez, M.
    Perez, C.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1291 - 1299
  • [5] Multi-document Summarization using Evolutionary Multi-objective Optimization
    Jung, Chihoon
    Datta, Rituparna
    Segev, Aviv
    [J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 31 - 32
  • [6] Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
    Alqaisi, Rana
    Ghanem, Wasel
    Qaroush, Aziz
    [J]. IEEE ACCESS, 2020, 8 : 228206 - 228224
  • [7] Decomposition-based multi-objective differential evolution for extractive multi-document automatic text summarization
    Wahab, Muhammad Hafizul Hazmi
    Hamid, Nor Asilah Wati Abdul
    Subramaniam, Shamala
    Latip, Rohaya
    Othman, Mohamed
    [J]. APPLIED SOFT COMPUTING, 2024, 151
  • [8] Extractive Multi-Document Text Summarization by Using Binary Particle Swarm Optimization
    Potnurwar, Archana
    Pimpalshende, Anjusha
    Aote, Shailendra S.
    Bongirwar, Vrusbali
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 32 - 34
  • [9] Survey on Extractive Text Summarization Methods with Multi-Document Datasets
    Varalakshmi, P. N. K.
    Kallimani, Jagadish S.
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2113 - 2119
  • [10] Multi-document extractive text summarization based on firefly algorithm
    Tomer, Minakshi
    Kumar, Manoj
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6057 - 6065