Towards Coherent Single-Document Summarization: An Integer Linear Programming-based Approach

被引:3
|
作者
Garcia, Rodrigo [1 ]
Lima, Rinaldo [1 ]
Espinasse, Bernard [2 ]
Oliveira, Hilario [3 ]
机构
[1] Univ Fed Rural Pernambuco, Recife, PE, Brazil
[2] Aix Marseille Univ, LSIS, UMR, CNRS, Marseille, France
[3] Univ Fed Pernambuco, Recife, PE, Brazil
关键词
Single-document Summarization; Extractive Summarization; Coherence; Entity Graph; Integer Linear Programming;
D O I
10.1145/3167132.3167211
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic Text Summarization (ATS) is a viable option to reduce the content of textual documents, e.g., as a possible preprocessing step in many text mining applications. Single-document extractive summarizers have been developed based on different approaches, but many of them have the drawback of producing summaries with low coherence among the selected sentences in the generated summaries. In this paper, we present an unsupervised summarization system as an attempt towards coherent extractive single-document summarization. This system relies on Integer Linear Programming (ILP) as an optimization technique for selecting the smallest subset of sentences of a document maximizing the coverage of relevant concepts. Furthermore, our solution uses a graph-based algorithm for two goals: representing both sentences and concepts and enabling local coherence scoring among the sentences in the generated summaries. The proposed system is evaluated on two single-document benchmark datasets (DUC 2001-2002) using ROUGE measures, and compared with other state-of-the-art summarizers. The achieved results are very competitive.
引用
收藏
页码:712 / 719
页数:8
相关论文
共 50 条
  • [21] Integer Programming-Based Approach to Attractor Detection and Control of Boolean Networks
    Akutsu, Tatsuya
    Zhao, Yang
    Hayashida, Morihiro
    Tamura, Takeyuki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (12): : 2960 - 2970
  • [22] Arabic Single-Document Text Summarization Using Particle Swarm Optimization Algorithm
    Al-Abdallah, Raed Z.
    Al-Taani, Ahmad T.
    [J]. ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 30 - 37
  • [23] The CNN-Corpus: A Large Textual Corpus for Single-Document Extractive Summarization
    Lins, Rafael Dueire
    Oliveira, Hilario
    Cabral, Luciano
    Batista, Jamilson
    Tenorio, Bruno
    Ferreira, Rafael
    Lima, Rinaldo
    Pereira e Silva, Gabriel de Franca
    Simske, Steven J.
    [J]. DOCENG'19: PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING 2019, 2019,
  • [24] Extractive Single-Document Summarization Based on Global-Best Harmony Search and a Greedy Local Optimizer
    Mendoza, Martha
    Cobos, Carlos
    Leon, Elizabeth
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND ITS APPLICATIONS, MICAI 2015, PT II, 2015, 9414 : 52 - 66
  • [25] Linear and Integer Programming-Based Heuristics for Cost-Optimal Numeric Planning
    Piacentini, Chiara
    Castro, Margarita P.
    Cire, Andre A.
    Beck, J. Christopher
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6254 - 6261
  • [26] An integer linear programming approach for bilinear integer programming
    Freire, Alexandre S.
    Moreno, Eduardo
    Vielma, Juan Pablo
    [J]. OPERATIONS RESEARCH LETTERS, 2012, 40 (02) : 74 - 77
  • [27] An integer linear programming model for multi document summarization of learning materials using phrase embedding technique
    Iyyappan, K. Sakkaravarthy
    Balasundaram, S. R.
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (06) : 2772 - 2785
  • [28] Integer programming-based approach to allocation of reporter genes for cell array analysis
    Hayashida, Morihiro
    Sun, Fuyan
    Aburatani, Sachiyo
    Horimoto, Katsuhisa
    Akutsu, Tatsuya
    [J]. OPTIMIZATION AND SYSTEMS BIOLOGY, 2007, 7 : 288 - +
  • [29] An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization
    Amina Chouigui
    Oussama Ben Khiroun
    Bilel Elayeb
    [J]. Arabian Journal for Science and Engineering, 2021, 46 : 3925 - 3938
  • [30] A Linear Programming-based Iterative Approach to Stabilizing Polynomial Dynamics
    Ben Sassi, Mohamed Amin
    Bartocci, Ezio
    Sankaranarayanan, Sriram
    [J]. IFAC PAPERSONLINE, 2017, 50 (01): : 10462 - 10469