A Comparison of Multiple Approaches for the Extractive Summarization of Portuguese Texts

被引:0
|
作者
Costa, Miguel [1 ]
Martins, Bruno [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1699 Lisbon, Portugal
来源
LINGUAMATICA | 2015年 / 7卷 / 01期
关键词
Automatic Summarization; Comparative Evaluation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Automatic document summarization is the task of automatically generating condensed versions of source texts, presenting itself as one of the fundamental problems in the areas of Information Retrieval and Natural Language Processing. In this paper, different extractive approaches are compared in the task of summarizing individual documents corresponding to journalistic texts written in Portuguese. Through the use of the ROUGE package for measuring the quality of the produced summaries, we report on results for two different experimental domains, involving (i) the generation of headlines for news articles written in European Portuguese, and (ii) the generation of summaries for news articles written in Brazilian Portuguese. The results demonstrate that methods based on the selection of the first sentences have the best results when building extractive news headlines in terms of several ROUGE metrics. Regarding the generation of summaries with more than one sentence, the method that achieved the best results was the LSA Squared algorithm, for the various ROUGE metrics.
引用
收藏
页码:23 / 40
页数:18
相关论文
共 50 条
  • [21] Extractive Summarization of Call Transcripts
    Biswas, Pratik K.
    Iakubovich, Aleksandr
    IEEE ACCESS, 2022, 10 : 119826 - 119840
  • [22] Unsupervised Extractive Summarization with BERT
    Dutulescu, Andreea -Nicoleta
    Dascalu, Mihai
    Ruseti, Stefan
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 158 - 164
  • [23] A Survey on Extractive Text Summarization
    Moratanch, N.
    Chitrakala, S.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 265 - 270
  • [24] Hybrid MemNet for Extractive Summarization
    Singh, Abhishek Kumar
    Gupta, Manish
    Varma, Vasudeva
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2303 - 2306
  • [25] Knowledge Distillation on Extractive Summarization
    Lin, Ying-Jia
    Tan, Daniel
    Chou, Tzu-Hsuan
    Kao, Hung-Yu
    Wang, Hsin-Yang
    2020 IEEE THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2020), 2020, : 71 - 76
  • [26] Accurate XML Summarization - A Comparison of Differing Approaches
    Moraes Filho, Jose de Aguiar
    Haerder, Theo
    DATABASES AND INFORMATION SYSTEMS V, 2009, 187 : 29 - 40
  • [27] Experimental analysis of multiple criteria for extractive multi-document text summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [28] Update Summarization for Portuguese
    Asevedo Nobrega, Fernando Antonio
    Salgueiro Pardo, Thiago Alexandre
    2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 348 - 353
  • [29] Extractive-Abstractive Summarization of Judgment Documents Using Multiple Attention Networks
    Gao, Yan
    Liu, Zhengtao
    Li, Juan
    Guo, Fan
    Xiao, Fei
    LOGIC AND ARGUMENTATION, CLAR 2021, 2021, 13040 : 486 - 494
  • [30] Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
    Zhang, Shiyue
    Wan, David
    Bansal, Mohit
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2153 - 2174