Towards Coherent Single-Document Summarization: An Integer Linear Programming-based Approach

被引:3
|
作者
Garcia, Rodrigo [1 ]
Lima, Rinaldo [1 ]
Espinasse, Bernard [2 ]
Oliveira, Hilario [3 ]
机构
[1] Univ Fed Rural Pernambuco, Recife, PE, Brazil
[2] Aix Marseille Univ, LSIS, UMR, CNRS, Marseille, France
[3] Univ Fed Pernambuco, Recife, PE, Brazil
关键词
Single-document Summarization; Extractive Summarization; Coherence; Entity Graph; Integer Linear Programming;
D O I
10.1145/3167132.3167211
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic Text Summarization (ATS) is a viable option to reduce the content of textual documents, e.g., as a possible preprocessing step in many text mining applications. Single-document extractive summarizers have been developed based on different approaches, but many of them have the drawback of producing summaries with low coherence among the selected sentences in the generated summaries. In this paper, we present an unsupervised summarization system as an attempt towards coherent extractive single-document summarization. This system relies on Integer Linear Programming (ILP) as an optimization technique for selecting the smallest subset of sentences of a document maximizing the coverage of relevant concepts. Furthermore, our solution uses a graph-based algorithm for two goals: representing both sentences and concepts and enabling local coherence scoring among the sentences in the generated summaries. The proposed system is evaluated on two single-document benchmark datasets (DUC 2001-2002) using ROUGE measures, and compared with other state-of-the-art summarizers. The achieved results are very competitive.
引用
收藏
页码:712 / 719
页数:8
相关论文
共 50 条
  • [1] A Concept-based Integer Linear Programming Approach for Single-Document Summarization
    Oliveira, Hilario
    Lima, Rinaldo
    Lins, Rafael Dueire
    Freitas, Fred
    Riss, Marcelo
    Simske, Steven J.
    [J]. PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 403 - 408
  • [2] A Regression-based Approach using Integer Linear Programming for Single-document Summarization
    Oliveira, Hilario
    Lins, Rafael Dueire
    Lima, Rinaldo
    Freitas, Fred
    Simske, Steven J.
    [J]. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 270 - 277
  • [3] Learning-Based Single-Document Summarization with Compression and Anaphoricity Constraints
    Durrett, Greg
    Berg-Kirkpatrick, Taylor
    Klein, Dan
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1998 - 2008
  • [4] Extraction-based single-document summarization using random indexing
    Chattejee, Niladri
    Mohan, Shiwali
    [J]. 19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 448 - +
  • [5] Progress in linear programming-based algorithms for integer programming: An exposition
    Johnson, EL
    Nemhauser, GL
    Savelsbergh, MWP
    [J]. INFORMS JOURNAL ON COMPUTING, 2000, 12 (01) : 2 - 23
  • [6] Hybrid-based Arabic Single-Document Text Summarization Approach Using Genatic Algorithm
    Jaradat, Yazan A.
    Al-Taani, Ahmad T.
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2016, : 85 - 91
  • [7] Abstractive Multi-Document Summarization via Joint Learning with Single-Document Summarization
    Jin, Hanqi
    Wan, Xiaojun
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2545 - 2554
  • [8] SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling
    Bao, Forrest Sheng
    Luo, Ge
    Li, Hebi
    Qiu, Minghui
    Yang, Yinfei
    He, Youbiao
    Chen, Cen
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2450 - 2458
  • [9] Extractive single-document summarization based on genetic operators and guided local search
    Mendoza, Martha
    Bonilla, Susana
    Noguera, Clara
    Cobos, Carlos
    Leon, Elizabeth
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (09) : 4158 - 4169
  • [10] Analyzing Preprocessing Settings for Urdu Single-document Extractive Summarization
    Humayoun, Muhammad
    Yu, Hwanjo
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3686 - 3693