A Comparison of Multiple Approaches for the Extractive Summarization of Portuguese Texts

被引：0

作者：

Costa, Miguel ^{[1
]}

Martins, Bruno ^{[1
]}

机构：

[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1699 Lisbon, Portugal

来源：

LINGUAMATICA | 2015年 / 7卷 / 01期

关键词：

Automatic Summarization; Comparative Evaluation;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Automatic document summarization is the task of automatically generating condensed versions of source texts, presenting itself as one of the fundamental problems in the areas of Information Retrieval and Natural Language Processing. In this paper, different extractive approaches are compared in the task of summarizing individual documents corresponding to journalistic texts written in Portuguese. Through the use of the ROUGE package for measuring the quality of the produced summaries, we report on results for two different experimental domains, involving (i) the generation of headlines for news articles written in European Portuguese, and (ii) the generation of summaries for news articles written in Brazilian Portuguese. The results demonstrate that methods based on the selection of the first sentences have the best results when building extractive news headlines in terms of several ROUGE metrics. Regarding the generation of summaries with more than one sentence, the method that achieved the best results was the LSA Squared algorithm, for the various ROUGE metrics.

引用

页码：23 / 40

页数：18

共 50 条

[21] Extractive Summarization of Call Transcripts
Biswas, Pratik K.
Iakubovich, Aleksandr
IEEE ACCESS, 2022, 10 : 119826 - 119840
[22] Unsupervised Extractive Summarization with BERT
Dutulescu, Andreea -Nicoleta
Dascalu, Mihai
Ruseti, Stefan
2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 158 - 164
[23] A Survey on Extractive Text Summarization
Moratanch, N.
Chitrakala, S.
2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 265 - 270
[24] Hybrid MemNet for Extractive Summarization
Singh, Abhishek Kumar
Gupta, Manish
Varma, Vasudeva
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2303 - 2306
[25] Knowledge Distillation on Extractive Summarization
Lin, Ying-Jia
Tan, Daniel
Chou, Tzu-Hsuan
Kao, Hung-Yu
Wang, Hsin-Yang
2020 IEEE THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2020), 2020, : 71 - 76
[26] Accurate XML Summarization - A Comparison of Differing Approaches
Moraes Filho, Jose de Aguiar
Haerder, Theo
DATABASES AND INFORMATION SYSTEMS V, 2009, 187 : 29 - 40
[27] Experimental analysis of multiple criteria for extractive multi-document text summarization
Sanchez-Gomez, Jesus M.
Vega-Rodriguez, Miguel A.
Perez, Carlos J.
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
[28] Update Summarization for Portuguese
Asevedo Nobrega, Fernando Antonio
Salgueiro Pardo, Thiago Alexandre
2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 348 - 353
[29] Extractive-Abstractive Summarization of Judgment Documents Using Multiple Attention Networks
Gao, Yan
Liu, Zhengtao
Li, Juan
Guo, Fan
Xiao, Fei
LOGIC AND ARGUMENTATION, CLAR 2021, 2021, 13040 : 486 - 494
[30] Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
Zhang, Shiyue
Wan, David
Bansal, Mohit
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2153 - 2174

← 1 2 3 4 5 →